Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??D??D^	00111111001111110100010000111111001111110100010001011110	3f3f443f3f445e
SJIS-WIN	嘆他D嘆他D^	1001001001010001100100011011110001000100100100100101000110010001101111000100010001011110	925191bc44925191bc445e
EUC-JP	嘆他D嘆他D^	1100001110110010110000101011111001000100110000111011001011000010101111100100010001011110	c3b2c2be44c3b2c2be445e
UTF-8	嘆他D嘆他D^	111001011001100010000110111001001011101110010110010001001110010110011000100001101110010010111011100101100100010001011110	e59886e4bb9644e59886e4bb96445e
UHC	嘆他D嘆他D^	1111011110100011111101101110001001000100111101111010001111110110111000100100010001011110	f7a3f6e244f7a3f6e2445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)