Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??h??	0011111100111111011010000011111100111111	3f3f683f3f
SJIS-WIN	砥砥h砥砥	100100110111010110010011011101010110100010010011011101011001001101110101	937593756893759375
EUC-JP	砥砥h砥砥	110001011101011011000101110101100110100011000101110101101100010111010110	c5d6c5d668c5d6c5d6
UTF-8	砥砥h砥砥	11100111101000001010010111100111101000001010010101101000111001111010000010100101111001111010000010100101	e7a0a5e7a0a568e7a0a5e7a0a5
UHC	砥砥h砥砥	111100101011001011110010101100100110100011110010101100101111001010110010	f2b2f2b268f2b2f2b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)