Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	胼合桑讌	1110010001000101100011011000011110001100010010111110011010100101	e4458d878c4be6a5
EUC-JP	胼合桑讌	1110011110100110101110011110011110110111101011001110110010100111	e7a6b9e7b7aceca7
UTF-8	胼合桑讌	111010001000001110111100111001011001000010001000111001101010000110010001111010001010111010001100	e883bce59088e6a191e8ae8c
UHC	?合桑?	001111111111100111101010110111111100110100111111	3ff9eadfcd3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)