Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??i??iB	00111111001111110110100100111111001111110110100101000010	3f3f693f3f6942
SJIS-WIN	悉i悉iB	1111000111111000100011101011101101101001111100011111100010001110101110110110100101000010	f1f88ebb69f1f88ebb6942
EUC-JP	?悉i?悉iB	001111111011110010111101011010010011111110111100101111010110100101000010	3fbcbd693fbcbd6942
UTF-8	悉i悉iB	111011101000010110110011111001101000001010001001011010011110111010000101101100111110011010000010100010010110100101000010	ee85b3e6828969ee85b3e682896942
UHC	?悉i?悉iB	001111111110001111111010011010010011111111100011111110100110100101000010	3fe3fa693fe3fa6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)