Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?O??	00111111010011110011111100111111	3f4f3f3f
SJIS-WIN	絮O?先	111000110101000001001111001111111001000011100110	e3504f3f90e6
EUC-JP	絮O璿先	1110010110110001010011111000111111001100111010011100000011101000	e5b14f8fcce9c0e8
UTF-8	絮O璿先	11100111101101011010111001001111111001111001001010111111111001011000010110001000	e7b5ae4fe792bfe58588
UHC	絮O璿先	11011111111111000100111111100000110001111110000010111011	dffc4fe0c7e0bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)