Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	茫駈愁鍜	1110010010101001100010111110110110001111010001001110100001000111	e4a98bed8f44e847
EUC-JP	茫駈愁鍜	1110100010101011101101101110111110111101101001011110111110101000	e8abb6efbda5efa8
UTF-8	茫駈愁鍜	111010001000110010101011111010011010011110001000111001101000010010000001111010011000110110011100	e88cabe9a788e68481e98d9c
UHC	茫?愁?	110110001101010000111111111000011111111000111111	d8d43fe1fe3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)