Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??i??iB	00111111001111110110100100111111001111110110100101000010	3f3f693f3f6942
SJIS-WIN	遽･i遽･iB	111001111010111110100101011010011110011110101111101001010110100101000010	e7afa569e7afa56942
EUC-JP	遽･i遽･iB	1110111010110001100011101010010101101001111011101011000110001110101001010110100101000010	eeb18ea569eeb18ea56942
UTF-8	遽･i遽･iB	111010011000000110111101111011111011110110100101011010011110100110000001101111011110111110111101101001010110100101000010	e981bdefbda569e981bdefbda56942
UHC	遽?i遽?iB	110010111110100000111111011010011100101111101000001111110110100101000010	cbe83f69cbe83f6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)