To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???厭?????橈?????厭?????橈??B 0011111100111111001111111000100101111101001111110011111100111111001111110011111110011110111101000011111100111111001111110011111100111111100010010111110100111111001111110011111100111111001111111001111011110100001111110011111101000010 3f3f3f897d3f3f3f3f3f9ef43f3f3f3f3f897d3f3f3f3f3f9ef43f3f42
EUC-JP 倻??厭??倻??橈??倻??厭??倻??橈??B 10001111101100011111011000111111001111111011000111011110001111110011111110001111101100011111011000111111001111111101110011110110001111110011111110001111101100011111011000111111001111111011000111011110001111110011111110001111101100011111011000111111001111111101110011110110001111110011111101000010 8fb1f63f3fb1de3f3f8fb1f63f3fdcf63f3f8fb1f63f3fb1de3f3f8fb1f63f3fdcf63f3f42
UTF-8 倻뽯젪厭묐젒倻뽯젪橈쎈젪倻뽯젪厭묐젒倻뽯젪橈쎈젪B 11100101100000001011101111101011101111011010111111101100101000001010101011100101100011101010110111101011101011001001000011101100101000001001001011100101100000001011101111101011101111011010111111101100101000001010101011100110101010011000100011101100100011101000100011101100101000001010101011100101100000001011101111101011101111011010111111101100101000001010101011100101100011101010110111101011101011001001000011101100101000001001001011100101100000001011101111101011101111011010111111101100101000001010101011100110101010011000100011101100100011101000100011101100101000001010101001000010 e580bbebbdafeca0aae58eadebac90eca092e580bbebbdafeca0aae6a988ec8e88eca0aae580bbebbdafeca0aae58eadebac90eca092e580bbebbdafeca0aae6a988ec8e88eca0aa42
UHC 倻뽯젪厭묐젒倻뽯젪橈쎈젪倻뽯젪厭묐젒倻뽯젪橈쎈젪B 11100101101001101001011011101011101000001010001011100110111101001001000111101011101000001001000111100101101001101001011011101011101000001010001011101000111110101011110111101011101000001010001011100101101001101001011011101011101000001010001011100110111101001001000111101011101000001001000111100101101001101001011011101011101000001010001011101000111110101011110111101011101000001010001001000010 e5a696eba0a2e6f491eba091e5a696eba0a2e8fabdeba0a2e5a696eba0a2e6f491eba091e5a696eba0a2e8fabdeba0a242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)