To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 醍??W^醍??\}v醍??W^醍??\}vB 100100011110011100111111001111110101011101011110100100011110011100111111001111110101110001111101011101101001000111100111001111110011111101010111010111101001000111100111001111110011111101011100011111010111011001000010 91e73f3f575e91e73f3f5c7d7691e73f3f575e91e73f3f5c7d7642
EUC-JP 醍?汶W^醍?汶\}v醍?汶W^醍?汶\}vB 1100001011101001001111111000111111000110111001010101011101011110110000101110100100111111100011111100011011100101010111000111110101110110110000101110100100111111100011111100011011100101010101110101111011000010111010010011111110001111110001101110010101011100011111010111011001000010 c2e93f8fc6e5575ec2e93f8fc6e55c7d76c2e93f8fc6e5575ec2e93f8fc6e55c7d7642
UTF-8 醍닺汶W^醍닺汶\}v醍닺汶W^醍닺汶\}vB 1110100110000110100011011110101110001011101110101110011010110001101101100101011101011110111010011000011010001101111010111000101110111010111001101011000110110110010111000111110101110110111010011000011010001101111010111000101110111010111001101011000110110110010101110101111011101001100001101000110111101011100010111011101011100110101100011011011001011100011111010111011001000010 e9868deb8bbae6b1b6575ee9868deb8bbae6b1b65c7d76e9868deb8bbae6b1b6575ee9868deb8bbae6b1b65c7d7642
UHC 醍닺汶W^醍닺汶\}v醍닺汶W^醍닺汶\}vB 1111000010110101101101001110100011011010101000010101011101011110111100001011010110110100111010001101101010100001010111000111110101110110111100001011010110110100111010001101101010100001010101110101111011110000101101011011010011101000110110101010000101011100011111010111011001000010 f0b5b4e8daa1575ef0b5b4e8daa15c7d76f0b5b4e8daa1575ef0b5b4e8daa15c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)