Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	???濟?	001111110011111100111111111000000101101000111111	3f3f3fe05a3f
EUC-JP	珩??濟?	1000111111001011111101110011111100111111110111111011101100111111	8fcbf73f3fdfbb3f
UTF-8	珩쩡렩濟렕	111001111000111110101001111011001010100110100001111010111010000010101001111001101011111110011111111010111010000010010101	e78fa9eca9a1eba0a9e6bf9feba095
UHC	珩쩡렩濟렕	11111011101010001100001011000100100011101011011111110000101011011000111010101010	fba8c2c48eb7f0ad8eaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)