Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????A	0011111100111111001111110011111101000001	3f3f3f3f41
SJIS-WIN	褻､贒ｧA	11100101111101101010010011111011101011111010011101000001	e5f6a4fbafa741
EUC-JP	褻､贒ｧA	11101010111110001000111010100100100011111101111111000011100011101010011101000001	eaf88ea48fdfc38ea741
UTF-8	褻､贒ｧA	11101000101001001011101111101111101111011010010011101000101101001001001011101111101111011010011101000001	e8a4bbefbda4e8b492efbda741
UHC	褻???A	111000001110000100111111001111110011111101000001	e0e13f3f3f41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)