To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 昭?沼W^昭?沼\}v昭?沼W^昭?沼\}vB 10001111101110100011111110001111110000000101011101011110100011111011101000111111100011111100000001011100011111010111011010001111101110100011111110001111110000000101011101011110100011111011101000111111100011111100000001011100011111010111011001000010 8fba3f8fc0575e8fba3f8fc05c7d768fba3f8fc0575e8fba3f8fc05c7d7642
EUC-JP 昭?沼W^昭?沼\}v昭?沼W^昭?沼\}vB 10111110101111000011111110111110110000100101011101011110101111101011110000111111101111101100001001011100011111010111011010111110101111000011111110111110110000100101011101011110101111101011110000111111101111101100001001011100011111010111011001000010 bebc3fbec2575ebebc3fbec25c7d76bebc3fbec2575ebebc3fbec25c7d7642
UTF-8 昭굩沼W^昭굩沼\}v昭굩沼W^昭굩沼\}vB 1110011010011000101011011110101010110101101010011110011010110010101111000101011101011110111001101001100010101101111010101011010110101001111001101011001010111100010111000111110101110110111001101001100010101101111010101011010110101001111001101011001010111100010101110101111011100110100110001010110111101010101101011010100111100110101100101011110001011100011111010111011001000010 e698adeab5a9e6b2bc575ee698adeab5a9e6b2bc5c7d76e698adeab5a9e6b2bc575ee698adeab5a9e6b2bc5c7d7642
UHC 昭굩沼W^昭굩沼\}v昭굩沼W^昭굩沼\}vB 1110000110111001100000101000111111100001101110110101011101011110111000011011100110000010100011111110000110111011010111000111110101110110111000011011100110000010100011111110000110111011010101110101111011100001101110011000001010001111111000011011101101011100011111010111011001000010 e1b9828fe1bb575ee1b9828fe1bb5c7d76e1b9828fe1bb575ee1b9828fe1bb5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)