To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 瓏苳苳W^瓏苳苳\}v瓏苳苳W^瓏苳苳\}vB 1110000011111010111001001001010011100100100101000101011101011110111000001111101011100100100101001110010010010100010111000111110101110110111000001111101011100100100101001110010010010100010101110101111011100000111110101110010010010100111001001001010001011100011111010111011001000010 e0fae494e494575ee0fae494e4945c7d76e0fae494e494575ee0fae494e4945c7d7642
EUC-JP 瓏苳苳W^瓏苳苳\}v瓏苳苳W^瓏苳苳\}vB 1110000011111100111001111111010011100111111101000101011101011110111000001111110011100111111101001110011111110100010111000111110101110110111000001111110011100111111101001110011111110100010101110101111011100000111111001110011111110100111001111111010001011100011111010111011001000010 e0fce7f4e7f4575ee0fce7f4e7f45c7d76e0fce7f4e7f4575ee0fce7f4e7f45c7d7642
UTF-8 瓏苳苳W^瓏苳苳\}v瓏苳苳W^瓏苳苳\}vB 1110011110010011100011111110100010001011101100111110100010001011101100110101011101011110111001111001001110001111111010001000101110110011111010001000101110110011010111000111110101110110111001111001001110001111111010001000101110110011111010001000101110110011010101110101111011100111100100111000111111101000100010111011001111101000100010111011001101011100011111010111011001000010 e7938fe88bb3e88bb3575ee7938fe88bb3e88bb35c7d76e7938fe88bb3e88bb3575ee7938fe88bb3e88bb35c7d7642
UHC 瓏??W^瓏??\}v瓏??W^瓏??\}vB 110101101110101000111111001111110101011101011110110101101110101000111111001111110101110001111101011101101101011011101010001111110011111101010111010111101101011011101010001111110011111101011100011111010111011001000010 d6ea3f3f575ed6ea3f3f5c7d76d6ea3f3f575ed6ea3f3f5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)