Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???@h	0011111100111111001111110100000001101000	3f3f3f4068
SJIS-WIN	癌屑厄@h	1000101011100000100010111111101110010110111011110100000001101000	8ae08bfb96ef4068
EUC-JP	癌屑厄@h	1011010011100010101101101111110111001100111100010100000001101000	b4e2b6fdccf14068
UTF-8	癌屑厄@h	1110011110011001100011001110010110110001100100011110010110001110100001000100000001101000	e7998ce5b191e58e844068
UHC	癌屑厄@h	1110010011011111111000001101101011100100111110000100000001101000	e4dfe0dae4f84068

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)