Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??_?	00111111001111110101111100111111	3f3f5f3f
SJIS-WIN	潁烋_豐	10011111111100011110000001111101010111111110011010110010	9ff1e07d5fe6b2
EUC-JP	潁烋_豐	11011110111100111101111111011110010111111110110010110100	def3dfde5fecb4
UTF-8	潁烋_豐	11100110101111011000000111100111100000111000101101011111111010001011000110010000	e6bd81e7838b5fe8b190
UHC	潁烋_?	111001111011100011111101110011100101111100111111	e7b8fdce5f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)