Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	也??揖?也	100101101110011100111111001111111001011101001011001111111001011011100111	96e73f3f974b3f96e7
EUC-JP	也??揖?也	110011001110100100111111001111111100110110101100001111111100110011101001	cce93f3fcdac3fcce9
UTF-8	也㏓뜉揖쟦也	111001001011100110011111111000111000111110010011111010111001110010001001111001101000111110010110111011001001111110100110111001001011100110011111	e4b99fe38f93eb9c89e68f96ec9fa6e4b99f
UHC	也㏓뜉揖쟦也	111001011010010110100111111010111000110110001100111010111110011110100000011010001110010110100101	e5a5a7eb8d8cebe7a068e5a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)