Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	淨?罰耳	10011111110001000011111110010100101100011000111010101000	9fc43f94b18ea8
EUC-JP	淨?罰耳	11011110110001100011111111001000101100111011110010101010	dec63fc8b3bcaa
UTF-8	淨곈罰耳	111001101011011110101000111010101011001110001000111001111011110110110000111010001000000010110011	e6b7a8eab388e7bdb0e880b3
UHC	淨곈罰耳	1110111111100100101100001110100111011011111010111110110010111100	efe4b0e9dbebecbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)