Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	蔕倶▼鐔随宗	111001001111011010001011111001001000000110100101111010000101110010010000100011111000111101000000	e4f68be481a5e85c908f8f40
EUC-JP	蔕倶▼鐔随宗	111010001111100010110110111001101010001010100111111011111011110110111111111011111011110110100001	e8f8b6e6a2a7efbdbfefbda1
UTF-8	蔕倶▼鐔随宗	111010001001010010010101111001011000000010110110111000101001011010111100111010011001000010010100111010011001101010001111111001011010111010010111	e89495e580b6e296bce99094e99a8fe5ae97
UHC	??▼??宗	0011111100111111101000011110010100111111001111111111000011110011	3f3fa1e53f3ff0f3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)