Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	逑ｦ繧苦ｵｧ	111001111001001110100110111000111000001010001011111010101011010110100111	e793a6e3828beab5a7
EUC-JP	逑ｦ繧苦ｵｧ	111011011111001110001110101001101110010111100010101101101110110010001110101101011000111010100111	edf38ea6e5e2b6ec8eb58ea7
UTF-8	逑ｦ繧苦ｵｧ	111010011000000010010001111011111011110110100110111001111011100110100111111010001000101110100110111011111011110110110101111011111011110110100111	e98091efbda6e7b9a7e88ba6efbdb5efbda7
UHC	逑??苦??	1100111111000111001111110011111111001101110010000011111100111111	cfc73f3fcdc83f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)