Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	紙?啼??伎	100011101000011000111111100110100110010100111111001111111000101011101010	8e863f9a653f3f8aea
EUC-JP	紙?啼??伎	101110111110011000111111110100111100011000111111001111111011010011101100	bbe63fd3c63f3fb4ec
UTF-8	紙렏啼재렓伎	111001111011010010011001111010111010000010001111111001011001010110111100111011001001111010101100111010111010000010010011111001001011110010001110	e7b499eba08fe595bcec9eaceba093e4bc8e
UHC	紙렏啼재렓伎	111100101011010110001110101001011111000010100110110000001110011110001110101010001101000011101011	f2b58ea5f0a6c0e78ea8d0eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)