Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	渦ｃ?孃	10001001010100011000001010000011001111111001101101101111	895182833f9b6f
EUC-JP	渦ｃ?孃	10110001101100101010001111100011001111111101010111010000	b1b2a3e33fd5d0
UTF-8	渦ｃ굩孃	111001101011100010100110111011111011110110000011111010101011010110101001111001011010110110000011	e6b8a6efbd83eab5a9e5ad83
UHC	渦ｃ굩孃	1110100010111110101000111110001110000010100011111110010110111110	e8bea3e3828fe5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)