To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 汚?瓦W^汚?瓦\}v汚?瓦W^汚?瓦\}vB 10001001100110000011111110001010101000100101011101011110100010011001100000111111100010101010001001011100011111010111011010001001100110000011111110001010101000100101011101011110100010011001100000111111100010101010001001011100011111010111011001000010 89983f8aa2575e89983f8aa25c7d7689983f8aa2575e89983f8aa25c7d7642
EUC-JP 汚?瓦W^汚?瓦\}v汚?瓦W^汚?瓦\}vB 10110001111110000011111110110100101001000101011101011110101100011111100000111111101101001010010001011100011111010111011010110001111110000011111110110100101001000101011101011110101100011111100000111111101101001010010001011100011111010111011001000010 b1f83fb4a4575eb1f83fb4a45c7d76b1f83fb4a4575eb1f83fb4a45c7d7642
UTF-8 汚씍瓦W^汚씍瓦\}v汚씍瓦W^汚씍瓦\}vB 1110011010110001100110101110110010010100100011011110011110010011101001100101011101011110111001101011000110011010111011001001010010001101111001111001001110100110010111000111110101110110111001101011000110011010111011001001010010001101111001111001001110100110010101110101111011100110101100011001101011101100100101001000110111100111100100111010011001011100011111010111011001000010 e6b19aec948de793a6575ee6b19aec948de793a65c7d76e6b19aec948de793a6575ee6b19aec948de793a65c7d7642
UHC 汚씍瓦W^汚씍瓦\}v汚씍瓦W^汚씍瓦\}vB 1110011111111101100111011010010011101000101111110101011101011110111001111111110110011101101001001110100010111111010111000111110101110110111001111111110110011101101001001110100010111111010101110101111011100111111111011001110110100100111010001011111101011100011111010111011001000010 e7fd9da4e8bf575ee7fd9da4e8bf5c7d76e7fd9da4e8bf575ee7fd9da4e8bf5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)