Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	蔘社	11100100111100111000111011010000	e4f38ed0
EUC-JP	蔘社	11101000111101011011110011010010	e8f5bcd2
UTF-8	蔘社	111010001001010010011000111001111010010010111110	e89498e7a4be
UHC	蔘社	11011111101110001101111011100100	dfb8dee4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)