To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 螳滄撃雋ュ骭ょョ毳螳滄撃雋ュ骭ょョ毬^ 111001011010111010011111111010011000110010000010111010001011001010101101111010011000110010000010111001011010111010011111011111011110010110101110100111111110100110001100100000101110100010110010101011011110100110001100100000101110010110101110100111110111101101011110 e5ae9fe98c82e8b2ade98c82e5ae9f7de5ae9fe98c82e8b2ade98c82e5ae9f7b5e
EUC-JP 螳滄撃雋ュ骭ょョ毳螳滄撃雋ュ骭ょョ毬^ 11101010101100001101111011101011101101111110001011110000101101001000111010101101111100011110110010100100111001111000111010101110110111011101111011101010101100001101111011101011101101111110001011110000101101001000111010101101111100011110110010100100111001111000111010101110110111011101110001011110 eab0deebb7e2f0b48eadf1eca4e78eaedddeeab0deebb7e2f0b48eadf1eca4e78eaedddc5e
UTF-8 螳滄撃雋ュ骭ょョ毳螳滄撃雋ュ骭ょョ毬^ 11101000100111101011001111100110101110111000010011100110100100101000001111101001100110111000101111101111101111011010110111101001101010101010110111100011100000101000011111101111101111011010111011100110101011111011001111101000100111101011001111100110101110111000010011100110100100101000001111101001100110111000101111101111101111011010110111101001101010101010110111100011100000101000011111101111101111011010111011100110101011111010110001011110 e89eb3e6bb84e69283e99b8befbdade9aaade38287efbdaee6afb3e89eb3e6bb84e69283e99b8befbdade9aaade38287efbdaee6afac5e
UHC 螳滄?雋??ょ??螳滄?雋??ょ?毬^ 11010011110110011111001111100111001111111111000111100110001111110011111110101010111001110011111100111111110100111101100111110011111001110011111111110001111001100011111100111111101010101110011100111111110011111011001101011110 d3d9f3e73ff1e63f3faae73f3fd3d9f3e73ff1e63f3faae73fcfb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)