Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	證邁?弔?	1110011010011010111001111011000000111111100100101010001000111111	e69ae7b03f92a23f
EUC-JP	證邁?弔?	1110101111111010111011101011001000111111110001001010010000111111	ebfaeeb23fc4a43f
UTF-8	證邁갬弔렒	111010001010110110001001111010011000001010000001111010101011000010101100111001011011110010010100111010111010000010010010	e8ad89e98281eab0ace5bc94eba092
UHC	證邁갬弔렒	11110001111110111101100011100100101100001011011111110000110000001000111010100111	f1fbd8e4b0b7f0c08ea7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)