Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	妖企?弔	10010111011001001000101011101001001111111001001010100010	97648ae93f92a2
EUC-JP	妖企?弔	11001101110001011011010011101011001111111100010010100100	cdc5b4eb3fc4a4
UTF-8	妖企㉢弔	111001011010011010010110111001001011110010000001111000111000100110100010111001011011110010010100	e5a696e4bc81e389a2e5bc94
UHC	妖企㉢弔	1110100011101101110100001110101010101000101100111111000011000000	e8edd0eaa8b3f0c0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)