Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???LB	0011111100111111001111110100110001000010	3f3f3f4c42
SJIS-WIN	辱ｉ?LB	10010000010010101000001010001001001111110100110001000010	904a82893f4c42
EUC-JP	辱ｉ?LB	10111111101010111010001111101001001111110100110001000010	bfaba3e93f4c42
UTF-8	辱ｉ뮈LB	1110100010111110101100011110111110111101100010011110101110101110100010000100110001000010	e8beb1efbd89ebae884c42
UHC	辱ｉ뮈LB	1110100110110100101000111110100110111001101111110100110001000010	e9b4a3e9b9bf4c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)