To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 äè¾ëŠëè†h 11100100111010001011111011101011100011011000101011101011111010001000011001101000 e4e8beeb8d8aebe88668
SJIS-WIN ?????????h 00111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f68
EUC-JP äè?ë??ëè?h 1000111110101011101000111000111110101011101100100011111110001111101010111011001100111111001111111000111110101011101100111000111110101011101100100011111101101000 8faba38fabb23f8fabb33f3f8fabb38fabb23f68
UTF-8 äè¾ëŠëè†h 11000011101001001100001110101000110000101011111011000011101010111100001010001101110000101000101011000011101010111100001110101000110000101000011001101000 c3a4c3a8c2bec3abc28dc28ac3abc3a8c28668
UHC ??¾??????h 0011111100111111101010001111101000111111001111110011111100111111001111110011111101101000 3f3fa8fa3f3f3f3f3f3f68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)