To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鬮ョ蟇よ升霑エ髢、鬮ョ蟇よ升霑エ遐ソ^ 11101001101010111010111011100101101011111000001011100110100011111010000111101000101111111011010011101001100101101010010011101001101010111010111011100101101011111000001011100110100011111010000111101000101111111011010011100111101000001011111101011110 e9abaee5af82e68fa1e8bfb4e996a4e9abaee5af82e68fa1e8bfb4e7a0bf5e
EUC-JP 鬮ョ蟇よ升霑エ髢、鬮ョ蟇よ升霑エ遐ソ^ 11110010101011011000111010101110111010101011000110100100111010001011111010100011111100001100000110001110101101001111000111110110100011101010010011110010101011011000111010101110111010101011000110100100111010001011111010100011111100001100000110001110101101001110111010100010100011101011111101011110 f2ad8eaeeab1a4e8bea3f0c18eb4f1f68ea4f2ad8eaeeab1a4e8bea3f0c18eb4eea28ebf5e
UTF-8 鬮ョ蟇よ升霑エ髢、鬮ョ蟇よ升霑エ遐ソ^ 11101001101011001010111011101111101111011010111011101000100111111000011111100011100000101000100011100101100011011000011111101001100111001001000111101111101111011011010011101001101010111010001011101111101111011010010011101001101011001010111011101111101111011010111011101000100111111000011111100011100000101000100011100101100011011000011111101001100111001001000111101111101111011011010011101001100000011001000011101111101111011011111101011110 e9acaeefbdaee89f87e38288e58d87e99c91efbdb4e9aba2efbda4e9acaeefbdaee89f87e38288e58d87e99c91efbdb4e98190efbdbf5e
UHC ???よ升霑??????よ升霑?遐?^ 0011111100111111001111111010101011101000111000111010111011101111110001010011111100111111001111110011111100111111001111111010101011101000111000111010111011101111110001010011111111111001110001100011111101011110 3f3f3faae8e3aeefc53f3f3f3f3f3faae8e3aeefc53ff9c63f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)