Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?P?PB	0011111101010000001111110101000001000010	3f503f5042
SJIS-WIN	癌P癌PB	10001010111000000101000010001010111000000101000001000010	8ae0508ae05042
EUC-JP	癌P癌PB	10110100111000100101000010110100111000100101000001000010	b4e250b4e25042
UTF-8	癌P癌PB	111001111001100110001100010100001110011110011001100011000101000001000010	e7998c50e7998c5042
UHC	癌P癌PB	11100100110111110101000011100100110111110101000001000010	e4df50e4df5042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)