Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	簔矣耨簔矣瞶	111000101100000111100001111000011110001111010011111000101100000111100001111000011110000111010110	e2c1e1e1e3d3e2c1e1e1e1d6
EUC-JP	簔矣耨簔矣瞶	111001001100001111100010111000111110011011010101111001001100001111100010111000111110001011011000	e4c3e2e3e6d5e4c3e2e3e2d8
UTF-8	簔矣耨簔矣瞶	111001111011000010010100111001111001111110100011111010001000000010101000111001111011000010010100111001111001111110100011111001111001111010110110	e7b094e79fa3e880a8e7b094e79fa3e79eb6
UHC	?矣??矣?	0011111111101011111110000011111100111111111010111111100000111111	3febf83f3febf83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)