Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	薔?魚???莊	11100101010010110011111110001011100110110011111100111111001111111110010010110101	e54b3f8b9b3f3f3fe4b5
EUC-JP	薔?魚???莊	11101001101011000011111110110101111110110011111100111111001111111110100010110111	e9ac3fb5fb3f3f3fe8b7
UTF-8	薔렡魚펼렠렩莊	111010001001011010010100111010111010000010100001111010011010110110011010111011011000111010111100111010111010000010100000111010111010000010101001111010001000111010001010	e89694eba0a1e9ad9aed8ebceba0a0eba0a9e88e8a
UHC	薔렡魚펼렠렩莊	1110110111111001100011101011001011100101111000001100011011101110100011101011000110001110101101111110110111110110	edf98eb2e5e0c6ee8eb18eb7edf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)