Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	肄雁欝苟	1110001111100101100010101110010110001001010101001110010010010001	e3e58ae58954e491
EUC-JP	肄雁欝苟	1110011011100111101101001110011110110001101101011110011111110001	e6e7b4e7b1b5e7f1
UTF-8	肄雁欝苟	111010001000001010000100111010011001101110000001111001101010110010011101111010001000101110011111	e88284e99b81e6ac9de88b9f
UHC	肄雁?苟	11101100101111011110010011010010001111111100111111000010	ecbde4d23fcfc2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)