Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??I??W	001111110011111101001001001111110011111101010111	3f3f493f3f57
SJIS-WIN	鴆自I鴆自W	11101001111011111000111010101001010010011110100111101111100011101010100101010111	e9ef8ea949e9ef8ea957
EUC-JP	鴆自I鴆自W	11110010111100011011110010101011010010011111001011110001101111001010101101010111	f2f1bcab49f2f1bcab57
UTF-8	鴆自I鴆自W	1110100110110100100001101110100010000111101010100100100111101001101101001000011011101000100001111010101001010111	e9b486e887aa49e9b486e887aa57
UHC	?自I?自W	0011111111101101101110110100100100111111111011011011101101010111	3fedbb493fedbb57

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)