Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????{d	00111111001111110011111100111111001111110111101101100100	3f3f3f3f3f7b64
SJIS-WIN	鶯??違?{d	111010011111001000111111001111111000100011100001001111110111101101100100	e9f23f3f88e13f7b64
EUC-JP	鶯??違?{d	111100101111010000111111001111111011000011100011001111110111101101100100	f2f43f3fb0e33f7b64
UTF-8	鶯ㅼ렲違땨{d	1110100110110110101011111110001110000101101111001110101110100000101100101110100110000001100101011110101110010101101010000111101101100100	e9b6afe385bceba0b2e98195eb95a87b64
UHC	鶯ㅼ렲違땨{d	111001011010001110100100111011001000111010111111111010101101111010001011011110000111101101100100	e5a3a4ec8ebfeade8b787b64

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)