To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\}?????????\{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101110001111101001111110011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 螳、霆頑昏霍矩ケソ\}螳、霆頑昏霍矩ケソ\{^ 1110010110101110101001001110100010111011100010101110011010001101101010001110100010110111100010111110100110111001101111110101110001111101111001011010111010100100111010001011101110001010111001101000110110101000111010001011011110001011111010011011100110111111010111000111101101011110 e5aea4e8bb8ae68da8e8b78be9b9bf5c7de5aea4e8bb8ae68da8e8b78be9b9bf5c7b5e
EUC-JP 螳、霆頑昏霍矩ケソ\}螳、霆頑昏霍矩ケソ\{^ 1110101010110000100011101010010011110000101111011011010011101000101110101010101011110000101110011011011011101011100011101011100110001110101111110101110001111101111010101011000010001110101001001111000010111101101101001110100010111010101010101111000010111001101101101110101110001110101110011000111010111111010111000111101101011110 eab08ea4f0bdb4e8baaaf0b9b6eb8eb98ebf5c7deab08ea4f0bdb4e8baaaf0b9b6eb8eb98ebf5c7b5e
UTF-8 螳、霆頑昏霍矩ケソ\}螳、霆頑昏霍矩ケソ\{^ 1110100010011110101100111110111110111101101001001110100110011100100001101110100110100000100100011110011010011000100011111110100110011100100011011110011110011111101010011110111110111101101110011110111110111101101111110101110001111101111010001001111010110011111011111011110110100100111010011001110010000110111010011010000010010001111001101001100010001111111010011001110010001101111001111001111110101001111011111011110110111001111011111011110110111111010111000111101101011110 e89eb3efbda4e99c86e9a091e6988fe99c8de79fa9efbdb9efbdbf5c7de89eb3efbda4e99c86e9a091e6988fe99c8de79fa9efbdb9efbdbf5c7b5e
UHC 螳?霆頑昏?矩??\}螳?霆頑昏?矩??\{^ 110100111101100100111111111011111111110111101000110101111111101111100111001111111100111110111011001111110011111101011100011111011101001111011001001111111110111111111101111010001101011111111011111001110011111111001111101110110011111100111111010111000111101101011110 d3d93feffde8d7fbe73fcfbb3f3f5c7dd3d93feffde8d7fbe73fcfbb3f3f5c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)