Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	闖ｴ蛻ｻ蟾曰	11101000100011111011010011100101100010001011101111100101101101111001111001001000	e88fb4e588bbe5b79e48
EUC-JP	闖ｴ蛻ｻ蟾曰	111011111110111110001110101101001110100111101000100011101011101111101010101110011101101110101001	efef8eb4e9e88ebbeab9dba9
UTF-8	闖ｴ蛻ｻ蟾曰	111010011001011110010110111011111011110110110100111010001001101110111011111011111011110110111011111010001001111110111110111001101001101110110000	e99796efbdb4e89bbbefbdbbe89fbee69bb0
UHC	闖???蟾曰	111101111110011000111111001111110011111111100000111010101110100011011000	f7e63f3f3fe0eae8d8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)