Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	淀弄??界	1001011110000100100110000100110100111111001111111000101001000101	9784984d3f3f8a45
EUC-JP	淀弄?橒界	11001101111001001100111110101110001111111000111111000101101011011011001110100110	cde4cfae3f8fc5adb3a6
UTF-8	淀弄욤橒界	111001101011011110000000111001011011110010000100111011001001101010100100111001101010100110010010111001111001010110001100	e6b780e5bc84ec9aa4e6a992e7958c
UHC	淀弄욤橒界	11101111111000111101011011100111101111111110100011101001111110001100110110100011	efe3d6e7bfe8e9f8cda3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)