Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	雅??乙	100010011110101100111111001111111000100110110011	89eb3f3f89b3
EUC-JP	雅??乙	101100101110110100111111001111111011001010110101	b2ed3f3fb2b5
UTF-8	雅뚭쑴乙	111010011001101110000101111010111001101010101101111011001001000110110100111001001011100110011001	e99b85eb9aadec91b4e4b999
UHC	雅뚭쑴乙	1110010010111010100011001110101010111110101010011110101111100000	e4ba8ceabea9ebe0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)