Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??ThTB	001111110011111101010100011010000101010001000010	3f3f54685442
SJIS-WIN	蛭?ThTB	10010101011001110011111101010100011010000101010001000010	95673f54685442
EUC-JP	蛭?ThTB	11001001110010000011111101010100011010000101010001000010	c9c83f54685442
UTF-8	蛭샷ThTB	11101000100110111010110111101100100000111011011101010100011010000101010001000010	e89badec83b754685442
UHC	蛭샷ThTB	1111001011110100101111001010011001010100011010000101010001000010	f2f4bca654685442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)