Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???J^	0011111100111111001111110100101001011110	3f3f3f4a5e
SJIS-WIN	村多端J^	1001000110111010100100011011110110010010010110110100101001011110	91ba91bd925b4a5e
EUC-JP	村多端J^	1100001010111100110000101011111111000011101111000100101001011110	c2bcc2bfc3bc4a5e
UTF-8	村多端J^	1110011010011101100100011110010110100100100110101110011110101011101011110100101001011110	e69d91e5a49ae7abaf4a5e
UHC	村多端J^	1111010110111101110100101111110111010011101011100100101001011110	f5bdd2fdd3ae4a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)