Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?Lh?L	0011111101001100011010000011111101001100	3f4c683f4c
SJIS-WIN	汚Lh汚L	10001001100110000100110001101000100010011001100001001100	89984c6889984c
EUC-JP	汚Lh汚L	10110001111110000100110001101000101100011111100001001100	b1f84c68b1f84c
UTF-8	汚Lh汚L	111001101011000110011010010011000110100011100110101100011001101001001100	e6b19a4c68e6b19a4c
UHC	汚Lh汚L	11100111111111010100110001101000111001111111110101001100	e7fd4c68e7fd4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)