Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????^	001111110011111100111111001111110011111101011110	3f3f3f3f3f5e
SJIS-WIN	獰ｉ?彦?^	111000001101011010000010100010010011111110010101010001100011111101011110	e0d682893f95463f5e
EUC-JP	獰ｉ?彦?^	111000001101100010100011111010010011111111001001101001110011111101011110	e0d8a3e93fc9a73f5e
UTF-8	獰ｉ쉹彦푆^	11100111100011011011000011101111101111011000100111101100100010011011100111100101101111011010011011101101100100011000011001011110	e78db0efbd89ec89b9e5bda6ed91865e
UHC	獰ｉ쉹彦푆^	1110011110111110101000111110100110011010100011111110010111101001101111100100110101011110	e7bea3e99a8fe5e9be4d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)