To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???E???N\ 001111110011111100111111010001010011111100111111001111110100111001011100 3f3f3f453f3f3f4e5c
SJIS-WIN 狸旦皙E狸旦皙N\ 100100100100101110010010010101011110000110101010010001011001001001001011100100100101010111100001101010100100111001011100 924b9255e1aa45924b9255e1aa4e5c
EUC-JP 狸旦皙E狸旦皙N\ 110000111010110011000011101101101110001010101100010001011100001110101100110000111011011011100010101011000100111001011100 c3acc3b6e2ac45c3acc3b6e2ac4e5c
UTF-8 狸旦皙E狸旦皙N\ 111001111000101110111000111001101001011110100110111001111001101010011001010001011110011110001011101110001110011010010111101001101110011110011010100110010100111001011100 e78bb8e697a6e79a9945e78bb8e697a6e79a994e5c
UHC 狸旦?E狸旦?N\ 11010111111000011101001110101001001111110100010111010111111000011101001110101001001111110100111001011100 d7e1d3a93f45d7e1d3a93f4e5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)