Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	荅蛾す證	1110010010100111100010011110100110000010101101111110011010011010	e4a789e982b7e69a
EUC-JP	荅蛾す證	1110100010101001101100101110101110100100101110011110101111111010	e8a9b2eba4b9ebfa
UTF-8	荅蛾す證	111010001000110110000101111010001001101110111110111000111000000110011001111010001010110110001001	e88d85e89bbee38199e8ad89
UHC	?蛾す證	00111111111001001011011010101010101110011111000111111011	3fe4b6aab9f1fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)