To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥〓∥誼??恂レ?嚥〓∥誼??恂レ?^ 10011010100010111000000110101100100000010110000110001011011000100011111100111111100111001001011010000011100011000011111110011010100010111000000110101100100000010110000110001011011000100011111100111111100111001001011010000011100011000011111101011110 9a8b81ac81618b623f3f9c96838c3f9a8b81ac81618b623f3f9c96838c3f5e
EUC-JP 嚥〓‖誼??恂レ?嚥〓‖誼??恂レ?^ 11010011111010111010001010101110101000011100001010110101110000110011111100111111110101111111011010100101111011000011111111010011111010111010001010101110101000011100001010110101110000110011111100111111110101111111011010100101111011000011111101011110 d3eba2aea1c2b5c33f3fd7f6a5ec3fd3eba2aea1c2b5c33f3fd7f6a5ec3f5e
UTF-8 嚥〓∥誼놂쭓恂レ쨧嚥〓∥誼놂쭓恂レ쨧^ 11100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101011100001101000001011101100101011011001001111100110100000011000001011100011100000111010110011101100101010001010011111100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101011100001101000001011101100101011011001001111100110100000011000001011100011100000111010110011101100101010001010011101011110 e59aa5e38093e288a5e8aabceb8682ecad93e68182e383aceca8a7e59aa5e38093e288a5e8aabceb8682ecad93e68182e383aceca8a75e
UHC 嚥〓∥誼놂쭓恂レ쨧嚥〓∥誼놂쭓恂レ쨧^ 11100110101111111010000111101011101000011010101111101011111111101011001111101111101001111000101111100010111000011010101111101100101001001000001011100110101111111010000111101011101000011010101111101011111111101011001111101111101001111000101111100010111000011010101111101100101001001000001001011110 e6bfa1eba1abebfeb3efa78be2e1abeca482e6bfa1eba1abebfeb3efa78be2e1abeca4825e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)