Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???T	00111111001111110011111101010100	3f3f3f54
SJIS-WIN	鍾ｹ示T	100011111101111110111001100011101010011001010100	8fdfb98ea654
EUC-JP	鍾ｹ示T	10111110111000011000111010111001101111001010100001010100	bee18eb9bca854
UTF-8	鍾ｹ示T	11101001100011011011111011101111101111011011100111100111101001001011101001010100	e98dbeefbdb9e7a4ba54
UHC	鍾?示T	111100011010001100111111111000111100011001010100	f1a33fe3c654

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)