Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	膠笈畿藉	1110010001010000100010111000100010001011010001011110010101010011	e4508b888b45e553
EUC-JP	膠笈畿藉	1110011110110001101101011110100010110101101001101110100110110100	e7b1b5e8b5a6e9b4
UTF-8	膠笈畿藉	111010001000011010100000111001111010110010001000111001111001010110111111111010001001011110001001	e886a0e7ac88e795bfe89789
UHC	膠?畿藉	11001110111011110011111111010001101100101110110110111110	ceef3fd1b2edbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)