Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	蟒｣蝨假ｽ	1110010110111011101000111110010110011100100110001110111110111101	e5bba3e59c98efbd
EUC-JP	蟒｣蝨假ｽ	11101010101111011000111010100011111010011111110011010000111100011000111010111101	eabd8ea3e9fcd0f18ebd
UTF-8	蟒｣蝨假ｽ	111010001001111110010010111011111011110110100011111010001001110110101000111001011000000110000111111011111011110110111101	e89f92efbda3e89da8e58187efbdbd
UHC	??蝨假?	00111111001111111110001110100100110010101010001100111111	3f3fe3a4caa33f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)