Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	釮ﾄ雍ｭ鴉	1111101110111110110001001110100010110100101011011110100111101011	fbbec4e8b4ade9eb
EUC-JP	釮ﾄ雍ｭ鴉	1000111111100011101100001000111011000100111100001011011010001110101011011111001011101101	8fe3b08ec4f0b68eadf2ed
UTF-8	釮ﾄ雍ｭ鴉	111010011000011110101110111011111011111010000100111010011001101110001101111011111011110110101101111010011011010010001001	e987aeefbe84e99b8defbdade9b489
UHC	??雍?鴉	00111111001111111110100010111100001111111110010010111100	3f3fe8bc3fe4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)