To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????±??????????^ 001111110011111100111111001111110011111100111111001111110011111100111111101100010011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3fb13f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 隲、蜀暦スゥ諤懈イ±隲、蜀暦スゥ諤懈イ+^ 111010001010101110100100111001011000011010010111111011111011110110101001111001101000000010011100111001101011001010000001011111011110100010101011101001001110010110000110100101111110111110111101101010011110011010000000100111001110011010110010100000010111101101011110 e8aba4e58697efbda9e6809ce6b2817de8aba4e58697efbda9e6809ce6b2817b5e
EUC-JP 隲、蜀暦スゥ諤懈イ±隲、蜀暦スゥ諤懈イ+^ 1111000010101101100011101010010011101001111001101100111011110001100011101011110110001110101010011110101111100000110110001110100010001110101100101010000111011110111100001010110110001110101001001110100111100110110011101111000110001110101111011000111010101001111010111110000011011000111010001000111010110010101000011101110001011110 f0ad8ea4e9e6cef18ebd8ea9ebe0d8e88eb2a1def0ad8ea4e9e6cef18ebd8ea9ebe0d8e88eb2a1dc5e
UTF-8 隲、蜀暦スゥ諤懈イ±隲、蜀暦スゥ諤懈イ+^ 111010011001101010110010111011111011110110100100111010001001110010000000111001101001101010100110111011111011110110111101111011111011110110101001111010001010101110100100111001101000011110001000111011111011110110110010110000101011000111101001100110101011001011101111101111011010010011101000100111001000000011100110100110101010011011101111101111011011110111101111101111011010100111101000101010111010010011100110100001111000100011101111101111011011001011101111101111001000101101011110 e99ab2efbda4e89c80e69aa6efbdbdefbda9e8aba4e68788efbdb2c2b1e99ab2efbda4e89c80e69aa6efbdbdefbda9e8aba4e68788efbdb2efbc8b5e
UHC ??蜀????懈?±??蜀????懈?+^ 001111110011111111110101101110010011111100111111001111110011111111111010101010110011111110100001101111100011111100111111111101011011100100111111001111110011111100111111111110101010101100111111101000111010101101011110 3f3ff5b93f3f3f3ffaab3fa1be3f3ff5b93f3f3f3ffaab3fa3ab5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)