Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	宍贖丈ﾔ借	100011101011001111100110110111001000111111100100110101001000111011011000	8eb3e6dc8fe4d48ed8
EUC-JP	宍贖丈ﾔ借	10111100101101011110110011011110101111101110011010001110110101001011110011011010	bcb5ecdebee68ed4bcda
UTF-8	宍贖丈ﾔ借	111001011010111010001101111010001011010010010110111001001011100010001000111011111011111010010100111001011000000010011111	e5ae8de8b496e4b888efbe94e5809f
UHC	?贖丈?借	0011111111100001110110111110110111011011001111111111001110101000	3fe1dbeddb3ff3a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)