Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????~	001111110011111100111111001111110011111101111110	3f3f3f3f3f7e
SJIS-WIN	莵赦杓赦鴫~	1110010010110111100011101100110110001110110110111000111011001101100011101011000001111110	e4b78ecd8edb8ecd8eb07e
EUC-JP	莵赦杓赦鴫~	1110100010111001101111001100111110111100110111011011110011001111101111001011001001111110	e8b9bccfbcddbccfbcb27e
UTF-8	莵赦杓赦鴫~	11101000100011101011010111101000101101011010011011100110100111011001001111101000101101011010011011101001101101001010101101111110	e88eb5e8b5a6e69d93e8b5a6e9b4ab7e
UHC	?赦杓赦?~	001111111101111011110101111110001111010111011110111101010011111101111110	3fdef5f8f5def53f7e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)