Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	臟?珥?衆?第	1110010001100110001111111110000011100000001111111000111101001111001111111001000111100110	e4663fe0e03f8f4f3f91e6
EUC-JP	臟?珥?衆?第	1110011111000111001111111110000011100010001111111011110110110000001111111100001011101000	e7c73fe0e23fbdb03fc2e8
UTF-8	臟렞珥렮衆렲第	111010001000011110011111111010111010000010011110111001111000111110100101111010111010000010101110111010001010000110000110111010111010000010110010111001111010110010101100	e8879feba09ee78fa5eba0aee8a186eba0b2e7acac
UHC	臟렞珥렮衆렲第	1110110111110100100011101010111111101100101101001000111010111011111100011110101110001110101111111111000010101111	edf48eafecb48ebbf1eb8ebff0af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)