To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN 夭狡?nR夭狡?n^[夭狡?nR夭狡?n^[^ 10011010111011101110000011000010001111110110111001010010100110101110111011100000110000100011111101101110010111100101101110011010111011101110000011000010001111110110111001010010100110101110111011100000110000100011111101101110010111100101101101011110 9aeee0c23f6e529aeee0c23f6e5e5b9aeee0c23f6e529aeee0c23f6e5e5b5e
EUC-JP 夭狡?nR夭狡?n^[夭狡?nR夭狡?n^[^ 11010100111100001110000011000100001111110110111001010010110101001111000011100000110001000011111101101110010111100101101111010100111100001110000011000100001111110110111001010010110101001111000011100000110001000011111101101110010111100101101101011110 d4f0e0c43f6e52d4f0e0c43f6e5e5bd4f0e0c43f6e52d4f0e0c43f6e5e5b5e
UTF-8 夭狡섈nR夭狡섈n^[夭狡섈nR夭狡섈n^[^ 1110010110100100101011011110011110001011101000011110110010000100100010000110111001010010111001011010010010101101111001111000101110100001111011001000010010001000011011100101111001011011111001011010010010101101111001111000101110100001111011001000010010001000011011100101001011100101101001001010110111100111100010111010000111101100100001001000100001101110010111100101101101011110 e5a4ade78ba1ec84886e52e5a4ade78ba1ec84886e5e5be5a4ade78ba1ec84886e52e5a4ade78ba1ec84886e5e5b5e
UHC 夭狡섈nR夭狡섈n^[夭狡섈nR夭狡섈n^[^ 1110100011101100110011101110101010111100101010100110111001010010111010001110110011001110111010101011110010101010011011100101111001011011111010001110110011001110111010101011110010101010011011100101001011101000111011001100111011101010101111001010101001101110010111100101101101011110 e8ecceeabcaa6e52e8ecceeabcaa6e5e5be8ecceeabcaa6e52e8ecceeabcaa6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)