Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	謐ｧ蜀ｵ逖	1110011010001101101001111110010110000110101101011110011110011000	e68da7e586b5e798
EUC-JP	謐ｧ蜀ｵ逖	11101011111011011000111010100111111010011110011010001110101101011110110111111000	ebed8ea7e9e68eb5edf8
UTF-8	謐ｧ蜀ｵ逖	111010001010110010010000111011111011110110100111111010001001110010000000111011111011110110110101111010011000000010010110	e8ac90efbda7e89c80efbdb5e98096
UHC	謐?蜀??	11011010110011010011111111110101101110010011111100111111	dacd3ff5b93f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)