To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 蟆壌煤W^蟆壌煤\}v蟆壌煤W^蟆壌煤\}vB 1110010110110000100011111110101110010100100000010101011101011110111001011011000010001111111010111001010010000001010111000111110101110110111001011011000010001111111010111001010010000001010101110101111011100101101100001000111111101011100101001000000101011100011111010111011001000010 e5b08feb9481575ee5b08feb94815c7d76e5b08feb9481575ee5b08feb94815c7d7642
EUC-JP 蟆壌煤W^蟆壌煤\}v蟆壌煤W^蟆壌煤\}vB 1110101010110010101111101110110111000111111000010101011101011110111010101011001010111110111011011100011111100001010111000111110101110110111010101011001010111110111011011100011111100001010101110101111011101010101100101011111011101101110001111110000101011100011111010111011001000010 eab2beedc7e1575eeab2beedc7e15c7d76eab2beedc7e1575eeab2beedc7e15c7d7642
UTF-8 蟆壌煤W^蟆壌煤\}v蟆壌煤W^蟆壌煤\}vB 1110100010011111100001101110010110100011100011001110011110000101101001000101011101011110111010001001111110000110111001011010001110001100111001111000010110100100010111000111110101110110111010001001111110000110111001011010001110001100111001111000010110100100010101110101111011101000100111111000011011100101101000111000110011100111100001011010010001011100011111010111011001000010 e89f86e5a38ce785a4575ee89f86e5a38ce785a45c7d76e89f86e5a38ce785a4575ee89f86e5a38ce785a45c7d7642
UHC ??煤W^??煤\}v??煤W^??煤\}vB 001111110011111111011000111000000101011101011110001111110011111111011000111000000101110001111101011101100011111100111111110110001110000001010111010111100011111100111111110110001110000001011100011111010111011001000010 3f3fd8e0575e3f3fd8e05c7d763f3fd8e0575e3f3fd8e05c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)