To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 騾カ螟懶スュ荳サ蠍急騾カ螟懶スュ荳サ蠍宮^ 111010011000000010110110111001011010010010011100111011111011110110101101111001001011100010111011111001011011011010001011011111011110100110000000101101101110010110100100100111001110111110111101101011011110010010111000101110111110010110110110100010110111101101011110 e980b6e5a49cefbdade4b8bbe5b68b7de980b6e5a49cefbdade4b8bbe5b68b7b5e
EUC-JP 騾カ螟懶スュ荳サ蠍急騾カ螟懶スュ荳サ蠍宮^ 1111000111100000100011101011011011101010101001101101100011110001100011101011110110001110101011011110100010111010100011101011101111101010101110001011010111011110111100011110000010001110101101101110101010100110110110001111000110001110101111011000111010101101111010001011101010001110101110111110101010111000101101011101110001011110 f1e08eb6eaa6d8f18ebd8eade8ba8ebbeab8b5def1e08eb6eaa6d8f18ebd8eade8ba8ebbeab8b5dc5e
UTF-8 騾カ螟懶スュ荳サ蠍急騾カ螟懶スュ荳サ蠍宮^ 11101001101010001011111011101111101111011011011011101000100111101001111111100110100001111011011011101111101111011011110111101111101111011010110111101000100011011011001111101111101111011011101111101000101000001000110111100110100000001010010111101001101010001011111011101111101111011011011011101000100111101001111111100110100001111011011011101111101111011011110111101111101111011010110111101000100011011011001111101111101111011011101111101000101000001000110111100101101011101010111001011110 e9a8beefbdb6e89e9fe687b6efbdbdefbdade88db3efbdbbe8a08de680a5e9a8beefbdb6e89e9fe687b6efbdbdefbdade88db3efbdbbe8a08de5aeae5e
UHC ??螟懶??荳??急??螟懶??荳??宮^ 0011111100111111110110011010110111010100111110110011111100111111110101001110010100111111001111111101000011100001001111110011111111011001101011011101010011111011001111110011111111010100111001010011111100111111110011111110000001011110 3f3fd9add4fb3f3fd4e53f3fd0e13f3fd9add4fb3f3fd4e53f3fcfe05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)