To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嶢?莊?魏?麻嶢??◇嶢?莊?魏?麻嶢??●^ 1001101111010000001111111110010010110101001111111110100110110000001111111001011010000011100110111101000000111111001111111000000110011110100110111101000000111111111001001011010100111111111010011011000000111111100101101000001110011011110100000011111100111111100000011001110001011110 9bd03fe4b53fe9b03f96839bd03f3f819e9bd03fe4b53fe9b03f96839bd03f3f819c5e
EUC-JP 嶢?莊?魏?麻嶢?蔣◇嶢?莊?魏?麻嶢?蔣●^ 110101101101001000111111111010001011011100111111111100101011001000111111110010111110001111010110110100100011111110001111110110011011011010100001111111101101011011010010001111111110100010110111001111111111001010110010001111111100101111100011110101101101001000111111100011111101100110110110101000011111110001011110 d6d23fe8b73ff2b23fcbe3d6d23f8fd9b6a1fed6d23fe8b73ff2b23fcbe3d6d23f8fd9b6a1fc5e
UTF-8 嶢렖莊렰魏잰麻嶢렎蔣◇嶢렖莊렰魏잰麻嶢렎蔣●^ 11100101101101101010001011101011101000001001011011101000100011101000101011101011101000001011000011101001101011011000111111101100100111101011000011101001101110101011101111100101101101101010001011101011101000001000111011101000100101001010001111100010100101111000011111100101101101101010001011101011101000001001011011101000100011101000101011101011101000001011000011101001101011011000111111101100100111101011000011101001101110101011101111100101101101101010001011101011101000001000111011101000100101001010001111100010100101111000111101011110 e5b6a2eba096e88e8aeba0b0e9ad8fec9eb0e9babbe5b6a2eba08ee894a3e29787e5b6a2eba096e88e8aeba0b0e9ad8fec9eb0e9babbe5b6a2eba08ee894a3e2978f5e
UHC 嶢렖莊렰魏잰麻嶢렎蔣◇嶢렖莊렰魏잰麻嶢렎蔣●^ 111010001111001010001110101010111110110111110110100011101011110111101010111000001100000011101001110110001010101111101000111100101000111010100100111011011111100010100001110111101110100011110010100011101010101111101101111101101000111010111101111010101110000011000000111010011101100010101011111010001111001010001110101001001110110111111000101000011101110001011110 e8f28eabedf68ebdeae0c0e9d8abe8f28ea4edf8a1dee8f28eabedf68ebdeae0c0e9d8abe8f28ea4edf8a1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)