Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	五ｇ?徇??	100011001101110010000010100001110011111110011100011011010011111100111111	8cdc82873f9c6d3f3f
EUC-JP	五ｇ?徇??	101110001101111010100011111001110011111111010111110011100011111100111111	b8dea3e73fd7ce3f3f
UTF-8	五ｇ낯徇뉒푽	111001001011101010010100111011111011110110000111111010111000001010101111111001011011111010000111111010111000100110010010111011011001000110111101	e4ba94efbd87eb82afe5be87eb8992ed91bd
UHC	五ｇ낯徇뉒푽	111001111110100110100011111001111011001110111000111000101101111110000111111001111011111010001000	e7e9a3e7b3b8e2df87e7be88

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)