Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???B	00111111001111110011111101000010	3f3f3f42
SJIS-WIN	矣?劇B	111000011110000100111111100011001000000001000010	e1e13f8c8042
EUC-JP	矣?劇B	111000101110001100111111101101111110000001000010	e2e33fb7e042
UTF-8	矣편劇B	11100111100111111010001111101101100011101011100011100101100010101000011101000010	e79fa3ed8eb8e58a8742
UHC	矣편劇B	11101011111110001100011011101101110100001011110001000010	ebf8c6edd0bc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)