Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????P	0011111100111111001111110011111100111111001111110011111101010000	3f3f3f3f3f3f3f50
SJIS-WIN	???????P	0011111100111111001111110011111100111111001111110011111101010000	3f3f3f3f3f3f3f50
EUC-JP	???????P	0011111100111111001111110011111100111111001111110011111101010000	3f3f3f3f3f3f3f50
UTF-8	혢찼혝혙혗혣혺P	11101101100110001010001011101100101100001011110011101101100110001001110111101101100110001001100111101101100110001001011111101101100110001010001111101101100110001011101001010000	ed98a2ecb0bced989ded9899ed9897ed98a3ed98ba50
UHC	혢찼혝혙혗혣혺P	110000101000101111000011101000011100001010000111110000101000010011000010100000101100001010001100110000101001111101010000	c28bc3a1c287c284c282c28cc29f50

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)