Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	邵喜｢壱棠	111001111011100010001010111011001010001010001000111010111001111010101001	e7b88aeca288eb9ea9
EUC-JP	邵喜｢壱棠	11101110101110101011010011101110100011101010001010110000111011011101110010101011	eebab4ee8ea2b0eddcab
UTF-8	邵喜｢壱棠	111010011000001010110101111001011001011010011100111011111011110110100010111001011010001110110001111001101010001110100000	e982b5e5969cefbda2e5a3b1e6a3a0
UHC	邵喜??棠	1110000111010000111111011110110000111111001111111101001111010110	e1d0fdec3f3fd3d6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)