Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	逕菩肢縲	1110011110010100100101011110110010001110100010001110001110000000	e79495ec8e88e380
EUC-JP	逕菩肢縲	1110110111110100110010101110111010111011111010001110010111100000	edf4caeebbe8e5e0
UTF-8	逕菩肢縲	111010011000000010010101111010001000111110101001111010001000001010100010111001111011100010110010	e98095e88fa9e882a2e7b8b2
UHC	逕菩肢?	11001100111011111101110011001100111100101011011000111111	ccefdcccf2b63f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)