Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	賃?耳?紙	1001001011000000001111111000111010101000001111111000111010000110	92c03f8ea83f8e86
EUC-JP	賃?耳?紙	1100010011000010001111111011110010101010001111111011101111100110	c4c23fbcaa3fbbe6
UTF-8	賃렎耳렰紙	111010001011001110000011111010111010000010001110111010001000000010110011111010111010000010110000111001111011010010011001	e8b383eba08ee880b3eba0b0e7b499
UHC	賃렎耳렰紙	11101100111111001000111010100100111011001011110010001110101111011111001010110101	ecfc8ea4ecbc8ebdf2b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)