Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	魘閲魘鋭	1110100110110100100010010111101111101001101101001000100101110011	e9b4897be9b48973
EUC-JP	魘閲魘鋭	1111001010110110101100011101110011110010101101101011000111010100	f2b6b1dcf2b6b1d4
UTF-8	魘閲魘鋭	111010011010110110011000111010011001011010110010111010011010110110011000111010011000101110101101	e9ad98e996b2e9ad98e98bad
UHC	????	00111111001111110011111100111111	3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)