To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????U}????????U{^ 001111110011111100111111001111110011111100111111001111110011111101010101011111010011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 訒ュ迺ス訒イ雉ェU}訒ュ迺ス訒イ雉ェU{^ 1111101110100011101011011110011110010010101111011111101110100011101100101110100010110011101010100101010101111101111110111010001110101101111001111001001010111101111110111010001110110010111010001011001110101010010101010111101101011110 fba3ade792bdfba3b2e8b3aa557dfba3ade792bdfba3b2e8b3aa557b5e
EUC-JP 訒ュ迺ス訒イ雉ェU}訒ュ迺ス訒イ雉ェU{^ 1000111111011101110010001000111010101101111011011111001010001110101111011000111111011101110010001000111010110010111100001011010110001110101010100101010101111101100011111101110111001000100011101010110111101101111100101000111010111101100011111101110111001000100011101011001011110000101101011000111010101010010101010111101101011110 8fddc88eadedf28ebd8fddc88eb2f0b58eaa557d8fddc88eadedf28ebd8fddc88eb2f0b58eaa557b5e
UTF-8 訒ュ迺ス訒イ雉ェU}訒ュ迺ス訒イ雉ェU{^ 1110100010101000100100101110111110111101101011011110100010111111101110101110111110111101101111011110100010101000100100101110111110111101101100101110100110011011100010011110111110111101101010100101010101111101111010001010100010010010111011111011110110101101111010001011111110111010111011111011110110111101111010001010100010010010111011111011110110110010111010011001101110001001111011111011110110101010010101010111101101011110 e8a892efbdade8bfbaefbdbde8a892efbdb2e99b89efbdaa557de8a892efbdade8bfbaefbdbde8a892efbdb2e99b89efbdaa557b5e
UHC ??????雉?U}??????雉?U{^ 0011111100111111001111110011111100111111001111111111011011001011001111110101010101111101001111110011111100111111001111110011111100111111111101101100101100111111010101010111101101011110 3f3f3f3f3f3ff6cb3f557d3f3f3f3f3f3ff6cb3f557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)