Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?O?	001111110100111100111111	3f4f3f
SJIS-WIN	爲O獏	1110000010101000010011111110000011010001	e0a84fe0d1
EUC-JP	爲O獏	1110000010101010010011111110000011010011	e0aa4fe0d3
UTF-8	爲O獏	11100111100010001011001001001111111001111000110110001111	e788b24fe78d8f
UHC	爲O?	11101010110100110100111100111111	ead34f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)