To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 塋??宥??飮??嚥??諭??扱陰???ル?^ 10011010110010000011111100111111100101110100011100111111001111111001111101011010001111110011111110011010100010110011111100111111100101110100000000111111001111111000100010110101100010010100000100111111001111110011111110000011100010110011111101011110 9ac83f3f97473f3f9f5a3f3f9a8b3f3f97403f3f88b589413f3f3f838b3f5e
EUC-JP 塋??宥??飮??嚥??諭??扱陰???ル?^ 11010100110010100011111100111111110011011010100000111111001111111101110110111011001111110011111111010011111010110011111100111111110011011010000100111111001111111011000010110111101100011010001000111111001111110011111110100101111010110011111101011110 d4ca3f3fcda83f3fddbb3f3fd3eb3f3fcda13f3fb0b7b1a23f3f3fa5eb3f5e
UTF-8 塋졼끍宥얍냶飮뗮뮃嚥싰퍛諭얏럩扱陰쏁독戮ル쭅^ 11100101101000011000101111101100101000011011110011101011100000011000110111100101101011101010010111101100100101101000110111101011100000111011011011101001101000111010111011101011100101111010111011101011101011101000001111100101100110101010010111101100100010111011000011101101100011011001101111101000101010111010110111101100100101101000111111101011100111111010100111100110100010011011000111101001100110011011000011101100100011111000000111101011100011111000010111101111101001111001001011100011100000111010101111101100101011011000010101011110 e5a18beca1bceb818de5aea5ec968deb83b6e9a3aeeb97aeebae83e59aa5ec8bb0ed8d9be8abadec968feb9fa9e689b1e999b0ec8f81eb8f85efa792e383abecad855e
UHC 塋졼끍宥얍냶飮뗮뮃嚥싰퍛諭얏럩扱陰쏁독戮ル쭅^ 111001111010101110100000111000111000010110111110111010101110100110111110111001011000011010000110111010111110011010001011111011011001001010010010111001101011111110011010111010101011101110010010111010111011000110111110111001101000111010001100110100001110001011101011111001001001101111100111101101011011011011101011101111011010101111101011101001111000000101011110 e7aba0e385beeae9bee58686ebe68bed9292e6bf9aeabb92ebb1bee68e8cd0e2ebe49be7b5b6ebbdabeba7815e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)