Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??Þ??	0011111100111111110111100011111100111111	3f3fde3f3f
SJIS-WIN	懿翁?漿野	100111001111001010001001101001010011111110011111111101111001011011101100	9cf289a53f9ff796ec
EUC-JP	懿翁Þ漿野	1101100011110100101100101010011110001111101010011011000011011110111110011100110011101110	d8f4b2a78fa9b0def9ccee
UTF-8	懿翁Þ漿野	1110011010000111101111111110011110111111100000011100001110011110111001101011110010111111111010011000011110001110	e687bfe7bf81c39ee6bcbfe9878e
UHC	懿翁Þ漿野	11101011111100111110100010111010101010001010110111101101111011001110010110101111	ebf3e8baa8adedece5af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)