Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	倭▼?掖	10011000011000001000000110100101001111111001110101110100	986081a53f9d74
EUC-JP	倭▼?掖	11001111110000011010001010100111001111111101100111010101	cfc1a2a73fd9d5
UTF-8	倭▼뜐掖	111001011000000010101101111000101001011010111100111010111001110010010000111001101000111010010110	e580ade296bceb9c90e68e96
UHC	倭▼뜐掖	1110100011011110101000011110010110001101100100111110010011111010	e8dea1e58d93e4fa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)