Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	逕朧迯ｻ逞	111001111001010010011110010011111110011110001101101110111110011110010111	e7949e4fe78dbbe797
EUC-JP	逕朧迯ｻ逞	11101101111101001101101110110000111011011110110110001110101110111110110111110111	edf4dbb0eded8ebbedf7
UTF-8	逕朧迯ｻ逞	111010011000000010010101111001101001110010100111111010001011111110101111111011111011110110111011111010011000000010011110	e98095e69ca7e8bfafefbdbbe9809e
UHC	逕朧??逞	1100110011101111110101101110100000111111001111111101011011000001	ccefd6e83f3fd6c1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)