To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????z????????zB 00111111001111110011111100111111001111110011111100111111001111110111101000111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f7a42
SJIS-WIN 蠍エ隴溷キ撰スェz蠍エ隴溷キ撰スェzB 111001011011011010110100111010001010110110011111111001011011011110010000111011111011110110101010011110101110010110110110101101001110100010101101100111111110010110110111100100001110111110111101101010100111101001000010 e5b6b4e8ad9fe5b790efbdaa7ae5b6b4e8ad9fe5b790efbdaa7a42
EUC-JP 蠍エ隴溷キ撰スェz蠍エ隴溷キ撰スェzB 1110101010111000100011101011010011110000101011111101111011100111100011101011011111000000111100011000111010111101100011101010101001111010111010101011100010001110101101001111000010101111110111101110011110001110101101111100000011110001100011101011110110001110101010100111101001000010 eab88eb4f0afdee78eb7c0f18ebd8eaa7aeab88eb4f0afdee78eb7c0f18ebd8eaa7a42
UTF-8 蠍エ隴溷キ撰スェz蠍エ隴溷キ撰スェzB 111010001010000010001101111011111011110110110100111010011001101010110100111001101011101010110111111011111011110110110111111001101001001010110000111011111011110110111101111011111011110110101010011110101110100010100000100011011110111110111101101101001110100110011010101101001110011010111010101101111110111110111101101101111110011010010010101100001110111110111101101111011110111110111101101010100111101001000010 e8a08defbdb4e99ab4e6bab7efbdb7e692b0efbdbdefbdaa7ae8a08defbdb4e99ab4e6bab7efbdb7e692b0efbdbdefbdaa7a42
UHC ?????撰??z?????撰??zB 001111110011111100111111001111110011111111110011101111000011111100111111011110100011111100111111001111110011111100111111111100111011110000111111001111110111101001000010 3f3f3f3f3ff3bc3f3f7a3f3f3f3f3ff3bc3f3f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)