Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????????	001111110011111100111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f3f3f
SJIS-WIN	獄?????音⑤?	100011011001011000111111001111110011111100111111001111111000100110111001100001110100010000111111	8d963f3f3f3f3f89b987443f
EUC-JP	獄?????音??	1011100111110110001111110011111100111111001111110011111110110010101110110011111100111111	b9f63f3f3f3f3fb2bb3f3f
UTF-8	獄멸퀎兩볠섕音⑤젶	111001111000110110000100111010111010100110111000111011011000000010001110111011111010010110111000111010111011001110100000111011001000010010010101111010011001111110110011111000101001000110100100111011001010000010110110	e78d84eba9b8ed808eefa5b8ebb3a0ec8495e99fb3e291a4eca0b6
UHC	獄멸퀎兩볠섕音⑤젶	111010001010101110111000111010101011001110000100111001011011101110010011111001101011110010101100111010111110010110101000111010111010000010101010	e8abb8eab384e5bb93e6bcacebe5a8eba0aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)