To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 沃??猷??宥??繹??異??碎??沃??B 1001011110000000001111110011111110010111010100010011111100111111100101110100011100111111001111111110001110001000001111110011111110001000110110010011111100111111111000011110101000111111001111111001011110000000001111110011111101000010 97803f3f97513f3f97473f3fe3883f3f88d93f3fe1ea3f3f97803f3f42
EUC-JP 沃??猷??宥??繹??異??碎??沃??B 1100110111100000001111110011111111001101101100100011111100111111110011011010100000111111001111111110010111101000001111110011111110110000110110110011111100111111111000101110110000111111001111111100110111100000001111110011111101000010 cde03f3fcdb23f3fcda83f3fe5e83f3fb0db3f3fe2ec3f3fcde03f3f42
UTF-8 沃ㅺ낯猷꾤뵳宥듬춣繹먮씮異룩첑碎ㅻ깹沃쇱썤B 11100110101100101000001111100011100001011011101011101011100000101010111111100111100011001011011111101010101111101010010011101011101101011011001111100101101011101010010111101011100100111010110011101100101101101010001111100111101110011011100111101011101010001010111011101100100101001010111011100111100101011011000011101011101000111010100111101100101100101001000111100111101000101000111011100011100001011011101111101010101110011011100111100110101100101000001111101100100001111011000111101100100011011010010001000010 e6b283e385baeb82afe78cb7eabea4ebb5b3e5aea5eb93acecb6a3e7b9b9eba8aeec94aee795b0eba3a9ecb291e7a28ee385bbeab9b9e6b283ec87b1ec8da442
UHC 沃ㅺ낯猷꾤뵳宥듬춣繹먮씮異룩첑碎ㅻ깹沃쇱썤B 11101000101010101010010011101010101100111011100011101011101000111000010011100111100101001011000111101010111010011011010111101011101011011000010011100110101110101001000011101011100111011011111111101100101101101011011111101000101010101001111011100001111011111010010011101011101100101010000111101000101010101011110011101100100110111001011101000010 e8aaa4eab3b8eba384e794b1eae9b5ebad84e6ba90eb9dbfecb6b7e8aa9ee1efa4ebb2a1e8aabcec9b9742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)