Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	癌甦	10001010111000001110000101010011	8ae0e153
EUC-JP	癌甦	10110100111000101110000110110100	b4e2e1b4
UTF-8	癌甦	111001111001100110001100111001111001010010100110	e7998ce794a6
UHC	癌甦	11100100110111111110000111000001	e4dfe1c1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)