Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????BB	00111111001111110011111100111111001111110100001001000010	3f3f3f3f3f4242
SJIS-WIN	湘繒鬘ﾍﾛBB	10001111110000111111101110001111111010011010000111001101110110110100001001000010	8fc3fb8fe9a1cddb4242
EUC-JP	湘繒鬘ﾍﾛBB	10111110110001011000111111010100110101001111001010100011100011101100110110001110110110110100001001000010	bec58fd4d4f2a38ecd8edb4242
UTF-8	湘繒鬘ﾍﾛBB	1110011010111001100110001110011110111001100100101110100110101100100110001110111110111110100011011110111110111110100110110100001001000010	e6b998e7b992e9ac98efbe8defbe9b4242
UHC	湘繒???BB	110111111100111111110001111110010011111100111111001111110100001001000010	dfcff1f93f3f3f4242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)