Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??ÆB	00111111001111111100011001000010	3f3fc642
SJIS-WIN	巍私?B	100110111101100110001110100001000011111101000010	9bd98e843f42
EUC-JP	巍私ÆB	1101011011011011101110111110010010001111101010011010000101000010	d6dbbbe48fa9a142
UTF-8	巍私ÆB	111001011011011110001101111001111010011110000001110000111000011001000010	e5b78de7a781c38642
UHC	巍私ÆB	11101000111001001101111011100111101010001010000101000010	e8e4dee7a8a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)