To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蜊伜・ェ蜊帝ア亥叉蟆雁腰螂ェ蜊帝ア亥叉謠タ 11100101100011011001100011100101101001011010101011100101100011011001001011101001101100011000100011100101100011011011001111100101101100001000101011100101100011011001100011100101101001011010101011100101100011011001001011101001101100011000100011100101100011011011001111100110100011111000001101011110 e58d98e5a5aae58d92e9b188e58db3e5b08ae58d98e5a5aae58d92e9b188e58db3e68f835e
EUC-JP 蜊伜・ェ蜊帝ア亥叉蟆雁腰螂ェ蜊帝ア亥叉謠タ 111010011110110111010000111001111000111010100101100011101010101011101001111011011100010011101011100011101011000110110000111001111011101010110101111010101011001010110100111001111011100111111000111010101010011110001110101010101110100111101101110001001110101110001110101100011011000011100111101110101011010111101011111011111010010110111111 e9edd0e78ea58eaae9edc4eb8eb1b0e7bab5eab2b4e7b9f8eaa78eaae9edc4eb8eb1b0e7bab5ebefa5bf
UTF-8 蜊伜・ェ蜊帝ア亥叉蟆雁腰螂ェ蜊帝ア亥叉謠タ 111010001001110010001010111001001011110010011100111011111011110110100101111011111011110110101010111010001001110010001010111001011011100010011101111011111011110110110001111001001011101010100101111001011000111110001001111010001001111110000110111010011001101110000001111010001000010110110000111010001001111010000010111011111011110110101010111010001001110010001010111001011011100010011101111011111011110110110001111001001011101010100101111001011000111110001001111010001010110010100000111000111000001010111111 e89c8ae4bc9cefbda5efbdaae89c8ae5b89defbdb1e4baa5e58f89e89f86e99b81e885b0e89e82efbdaae89c8ae5b89defbdb1e4baa5e58f89e8aca0e382bf
UHC ?????帝?亥叉?雁腰螂??帝?亥叉謠タ 0011111100111111001111110011111100111111111100001010100000111111111110101010010011110011101010010011111111100100110100101110100110100110110101011100110000111111001111111111000010101000001111111111101010100100111100111010100111101001101010101010101110111111 3f3f3f3f3ff0a83ffaa4f3a93fe4d2e9a6d5cc3f3ff0a83ffaa4f3a9e9aaabbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)