Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?i?iB	0011111101101001001111110110100101000010	3f693f6942
SJIS-WIN	蔡i蔡iB	11100100111011110110100111100100111011110110100101000010	e4ef69e4ef6942
EUC-JP	蔡i蔡iB	11101000111100010110100111101000111100010110100101000010	e8f169e8f16942
UTF-8	蔡i蔡iB	111010001001010010100001011010011110100010010100101000010110100101000010	e894a169e894a16942
UHC	蔡i蔡iB	11110011111110010110100111110011111110010110100101000010	f3f969f3f96942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)