To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鳥戡??遭???製 10010010101110011001110101000001001111110011111110010001100110000011111100111111001111111001000010111011 92b99d413f3f91983f3f3f90bb
EUC-JP 鳥戡??遭???製 11000100101110111101100110100010001111110011111111000001111110000011111100111111001111111100000010111101 c4bbd9a23f3fc1f83f3f3fc0bd
UTF-8 鳥戡렰렢遭ㆁ렰렡製 111010011011001110100101111001101000100010100001111010111010000010110000111010111010000010100010111010011000000110101101111000111000011010000001111010111010000010110000111010111010000010100001111010001010001110111101 e9b3a5e688a1eba0b0eba0a2e981ade38681eba0b0eba0a1e8a3bd
UHC 鳥戡렰렢遭ㆁ렰렡製 111100001110100011001010111100011000111010111101100011101011001111110000111001001010010011110001100011101011110110001110101100101111000010110010 f0e8caf18ebd8eb3f0e4a4f18ebd8eb2f0b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)