To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 壯??墻??謠??疫???ヮ?絶?ぜ節??B 100110101110000100111111001111111001101011010100001111110011111111100110100011110011111100111111100010010111010100111111001111110011111110000011100011100011111110010000111000100011111110000010101110101001000011011111001111110011111101000010 9ae13f3f9ad43f3fe68f3f3f89753f3f3f838e3f90e23f82ba90df3f3f42
EUC-JP 壯??墻??謠??疫???ヮ?絶?ぜ節??B 110101001110001100111111001111111101010011010110001111110011111111101011111011110011111100111111101100011101011000111111001111110011111110100101111011100011111111000000111001000011111110100100101111001100000011100001001111110011111101000010 d4e33f3fd4d63f3febef3f3fb1d63f3f3fa5ee3fc0e43fa4bcc0e13f3f42
UTF-8 壯백쎁墻쇽풛謠쇽슴疫운뀿樂ヮ겮絶욇ぜ節욘뮈B 11100101101000111010111111101011101100001011000111101100100011101000000111100101101000101011101111101100100001111011110111101101100100101001101111101000101011001010000011101100100001111011110111101100100010101011010011100111100101101010101111101100100110101011010011101011100000001011111111101111101001101011111111100011100000111010111011101010101100101010111011100111101101011011011011101100100110101000011111100011100000011001110011100111101011111000000011101100100110101001100011101011101011101000100001000010 e5a3afebb0b1ec8e81e5a2bbec87bded929be8aca0ec87bdec8ab4e796abec9ab4eb80bfefa6bfe383aeeab2aee7b5b6ec9a87e3819ce7af80ec9a98ebae8842
UHC 壯백쎁墻쇽풛謠쇽슴疫운뀿樂ヮ겮絶욇ぜ節욘뮈B 11101101111000001011100111101001100110111010101111101101110111111011110011101111101111101001111011101001101010101011110011101111101111011011111111100110101110011011111111101110100001011011010111101000111110011010101111101110100000011011110011101111101111101001111011101001101010101011110011101111101111011011111111100110101110011011111101000010 ede0b9e99babeddfbcefbe9ee9aabcefbdbfe6b9bfee85b5e8f9abee81bcefbe9ee9aabcefbdbfe6b9bf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)