Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?C?	001111110100001100111111	3f433f
SJIS-WIN	曙C遜	1000111110001100010000111001000110111011	8f8c4391bb
EUC-JP	曙C遜	1011110111101100010000111100001010111101	bdec43c2bd
UTF-8	曙C遜	11100110100110111001100101000011111010011000000110011100	e69b9943e9819c
UHC	曙C遜	1101111111110101010000111110000111100001	dff543e1e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)