Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????]	00111111001111110011111100111111001111110011111101011101	3f3f3f3f3f3f5d
SJIS-WIN	?功ぴ??を]	00111111100011001111011110000010110100100011111100111111100000101111000001011101	3f8cf782d23f3f82f05d
EUC-JP	?功ぴ??を]	00111111101110001111100110100100110101000011111100111111101001001111001001011101	3fb8f9a4d43f3fa4f25d
UTF-8	룴功ぴ룵핊を]	11101011101000111011010011100101100010101001111111100011100000011011010011101011101000111011010111101101100101011000101011100011100000101001001001011101	eba3b4e58a9fe381b4eba3b5ed958ae382925d
UHC	룴功ぴ룵핊を]	10001111101010011100110111101101101010101101010010001111101010101100000010001111101010101111001001011101	8fa9cdedaad48faac08faaf25d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)