To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠍エ隶悟キ宣、難スィ騾ク蝗∝キ蝉ソッ雎 1110010110110110101101001110100010101110100011001110010110110111100100001110100110100100100100111110111110111101101010001110100110000000101110001110010110011011100000011110010110110111100100001110010010111111101011111110100010110001 e5b6b4e8ae8ce5b790e9a493efbda8e980b8e59b81e5b790e4bfafe8b1
EUC-JP 蠍エ隶悟キ宣、難スィ騾ク蝗∝キ蝉ソッ雎 1110101010111000100011101011010011110000101100001011100011100111100011101011011111000000111010111000111010100100110001101111000110001110101111011000111010101000111100011110000010001110101110001110100111111011101000101110011110001110101101111100000011100110100011101011111110001110101011111111000010110011 eab88eb4f0b0b8e78eb7c0eb8ea4c6f18ebd8ea8f1e08eb8e9fba2e78eb7c0e68ebf8eaff0b3
UTF-8 蠍エ隶悟キ宣、難スィ騾ク蝗∝キ蝉ソッ雎 111010001010000010001101111011111011110110110100111010011001101010110110111001101000001010011111111011111011110110110111111001011010111010100011111011111011110110100100111010011001101110100011111011111011110110111101111011111011110110101000111010011010100010111110111011111011110110111000111010001001110110010111111000101000100010011101111011111011110110110111111010001001110110001001111011111011110110111111111011111011110110101111111010011001101110001110 e8a08defbdb4e99ab6e6829fefbdb7e5aea3efbda4e99ba3efbdbdefbda8e9a8beefbdb8e89d97e2889defbdb7e89d89efbdbfefbdafe99b8e
UHC ???悟?宣?難????蝗∝????雎 00111111001111110011111111100111111101100011111111100000101111100011111111010001111100010011111100111111001111110011111111111100110110011010000111110000001111110011111100111111001111111110111011010001 3f3f3fe7f63fe0be3fd1f13f3f3f3ffcd9a1f03f3f3f3feed1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)