To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 蠍エ隶悟キ宣ァ募 11100101101101101011010011101000101011101000110011100101101101111001000011101001101001111001010111100101 e5b6b4e8ae8ce5b790e9a795e5
EUC-JP 蠍エ隶悟キ宣ァ募 11101010101110001000111010110100111100001011000010111000111001111000111010110111110000001110101110001110101001111100101011100111 eab88eb4f0b0b8e78eb7c0eb8ea7cae7
UTF-8 蠍エ隶悟キ宣ァ募 111010001010000010001101111011111011110110110100111010011001101010110110111001101000001010011111111011111011110110110111111001011010111010100011111011111011110110100111111001011000101110011111 e8a08defbdb4e99ab6e6829fefbdb7e5aea3efbda7e58b9f
UHC ???悟?宣?募 0011111100111111001111111110011111110110001111111110000010111110001111111101100110110100 3f3f3fe7f63fe0be3fd9b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)