Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	????ヨ?	00111111001111110011111100111111100000111000100000111111	3f3f3f3f83883f
EUC-JP	???嫄ヨ?	001111110011111100111111100011111011101010100001101001011110100000111111	3f3f3f8fbaa1a5e83f
UTF-8	歷몃씮嫄ヨ린	111011111010011010001100111010111010101010000011111011001001010010101110111001011010101110000100111000111000001110101000111010111010011010110000	efa68cebaa83ec94aee5ab84e383a8eba6b0
UHC	歷몃씮嫄ヨ린	111001101011100010111000111010111001110110111111111010101011000110101011111010001011100010110000	e6b8b8eb9dbfeab1abe8b8b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)