Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????B	0011111100111111001111110011111101000010	3f3f3f3f42
SJIS-WIN	蛛ｲ謔営B	1110010110000001101100101110011010000010100010010110001101000010	e581b2e682896342
EUC-JP	蛛ｲ謔営B	111010011110000110001110101100101110101111100010101100011100010001000010	e9e18eb2ebe2b1c442
UTF-8	蛛ｲ謔営B	11101000100110111001101111101111101111011011001011101000101011001001010011100101100101101011011001000010	e89b9befbdb2e8ac94e596b642
UHC	蛛?謔?B	11110001110010000011111111111001110011000011111101000010	f1c83ff9cc3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)