Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	柴館鴫蛇竺	10001110110001001111101111111001100011101011000010001110110101101000111010110001	8ec4fbf98eb08ed68eb1
EUC-JP	柴?鴫蛇竺	101111001100011000111111101111001011001010111100110110001011110010110011	bcc63fbcb2bcd8bcb3
UTF-8	柴館鴫蛇竺	111001101001111110110100111011111010100010101100111010011011010010101011111010001001101110000111111001111010101110111010	e69fb4efa8ace9b4abe89b87e7abba
UHC	柴??蛇竺	1110001111000011001111110011111111011110111011111111010111100111	e3c33f3fdeeff5e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)