To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 陞ゑスェ隴夐。鯉ス「荵溘¥髮埼メ魑ゥE 11101000100111101000001011101111101111011010101011101000101011011001101011101001101000011000110011101111101111011010001011100100101110011001111111100011100000011000111111101001100110111000110111101001100000111000000111101001101100111010100101000101 e89e82efbdaae8ad9ae9a18cefbda2e4b99fe3818fe99b8de98381e9b3a945
EUC-JP 陞ゑスェ隴夐。鯉ス「荵溘¥髮埼メ魑ゥE 11101111111111101010010011110001100011101011110110001110101010101111000010101111110101001110101110001110101000011011100011110001100011101011110110001110101000101110100010111011110111101110010110100001111011111111000111111011101110101110101110100101111000011111001010110101100011101010100101000101 effea4f18ebd8eaaf0afd4eb8ea1b8f18ebd8ea2e8bbdee5a1eff1fbbaeba5e1f2b58ea945
UTF-8 陞ゑスェ隴夐。鯉ス「荵溘¥髮埼メ魑ゥE 11101001100110011001111011100011100000101001000111101111101111011011110111101111101111011010101011101001100110101011010011100101101001001001000011101111101111011010000111101001101011111000100111101111101111011011110111101111101111011010001011101000100011011011010111100110101110101001100011101111101111111010010111101001101010111010111011100101100111111011110011100011100000111010000111101001101011011001000111101111101111011010100101000101 e9999ee38291efbdbdefbdaae99ab4e5a490efbda1e9af89efbdbdefbda2e88db5e6ba98efbfa5e9abaee59fbce383a1e9ad91efbda945
UHC 陞ゑ?????鯉????¥髮埼メ??E 1110001110110011101010101111000100111111001111110011111100111111001111111101011111101111001111110011111100111111001111111010000111001101110110111010010111010000111100101010101111100001001111110011111101000101 e3b3aaf13f3f3f3f3fd7ef3f3f3f3fa1cddba5d0f2abe13f3f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)