Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	肢肆度正	1000111010001000111000111110011010010011011110001001000010110011	8e88e3e6937890b3
EUC-JP	肢肆度正	1011101111101000111001101110100011000101110110011100000010110101	bbe8e6e8c5d9c0b5
UTF-8	肢肆度正	111010001000001010100010111010001000001010000110111001011011101010100110111001101010110110100011	e882a2e88286e5baa6e6ada3
UHC	肢肆度正	1111001010110110110111101110101111010011111110001110111111100001	f2b6deebd3f8efe1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)