Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	獄℡ぞ	100011011001011010000111100001001000001010111100	8d96878482bc
EUC-JP	獄?ぞ	1011100111110110001111111010010010111110	b9f63fa4be
UTF-8	獄℡ぞ	111001111000110110000100111000101000010010100001111000111000000110011110	e78d84e284a1e3819e
UHC	獄℡ぞ	111010001010101110100010111001011010101010111110	e8aba2e5aabe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)