To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 碍??曖??娥 10001010010101100011111100111111100111100100001000111111001111111001101101001101 8a563f3f9e423f3f9b4d
EUC-JP 碍??曖??娥 10110011101101110011111100111111110110111010001100111111001111111101010110101110 b3b73f3fdba33f3fd5ae
UTF-8 碍걦뢙曖뤶귺娥 111001111010001010001101111010101011000110100110111010111010001010011001111001101001101110010110111010111010010010110110111010101011011110111010111001011010100010100101 e7a28deab1a6eba299e69b96eba4b6eab7bae5a8a5
UHC 碍걦뢙曖뤶귺娥 1110010011110100100000011000111110001111010101001110010011110010100011111110010010000011010000011110010010110000 e4f4818f8f54e4f28fe48341e4b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)