Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???\	00111111001111110011111101011100	3f3f3f5c
SJIS-WIN	薔企麻\	11100101010010111000101011101001100101101000001101011100	e54b8ae996835c
EUC-JP	薔企麻\	11101001101011001011010011101011110010111110001101011100	e9acb4ebcbe35c
UTF-8	薔企麻\	11101000100101101001010011100100101111001000000111101001101110101011101101011100	e89694e4bc81e9babb5c
UHC	薔企麻\	11101101111110011101000011101010110110001010101101011100	edf9d0ead8ab5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)