To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歪?????揄l?永?????乙щ????? 10011000011000110011111100111111001111110011111100111111100111011000100110000010100011000011111110001001011010010011111100111111001111110011111100111111100010011011001110000100100010110011111100111111001111110011111100111111 98633f3f3f3f3f9d89828c3f89693f3f3f3f3f89b3848b3f3f3f3f3f
EUC-JP 歪?ł瑗??揄l?永?????乙щ?邕??馹 110011111100010000111111100011111010100111001000100011111100110011000000001111110011111111011001111010011010001111101100001111111011000111001010001111110011111100111111001111110011111110110010101101011010011111101011001111111000111111100001111011010011111100111111100011111110100110100001 cfc43f8fa9c88fccc03f3fd9e9a3ec3fb1ca3f3f3f3f3fb2b5a7eb3f8fe1ed3f3f8fe9a1
UTF-8 歪뺣ł瑗뉛쬃揄l컭永띕끇六쀤벧乙щ뙆邕잆렕馹 11100110101011011010101011101011101110101010001111000101100000101110011110010001100101111110101110001001100110111110110010101100100000111110011010001111100001001110111110111101100011001110110010111011101011011110011010110000101110001110101110011101100101011110101110000001100001111110111110100111100100011110110010000000101001001110101110110010101001111110010010111001100110011101000110001001111010111001100110000110111010011000001010010101111011001001111010000110111010111010000010010101111010011010011010111001 e6adaaebbaa3c582e79197eb899becac83e68f84efbd8cecbbade6b0b8eb9d95eb8187efa791ec80a4ebb2a7e4b999d189eb9986e98295ec9e86eba095e9a6b9
UHC 歪뺣ł瑗뉛쬃揄l컭永띕끇六쀤벧乙щ뙆邕잆렕馹 1110100011100000100101011110101110101001101010011110101010111100100001111110111110100110100110101110101011110001101000111110110010110000100100111110011110110101101101101110101110000101101110111110101110111011100101111110010010111010101001101110101111100000101011001110101110001100100011001110100010111011100111111110001110001110101010101110110011110001 e8e095eba9a9eabc87efa69aeaf1a3ecb093e7b5b6eb85bbebbb97e4baa6ebe0aceb8c8ce8bb9fe38eaaecf1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)