To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 繹??曜??昻??W}繹??曜??昻??W{^ 1110001110001000001111110011111110010111011010100011111100111111111110101101000000111111001111110101011101111101111000111000100000111111001111111001011101101010001111110011111111111010110100000011111100111111010101110111101101011110 e3883f3f976a3f3ffad03f3f577de3883f3f976a3f3ffad03f3f577b5e
EUC-JP 繹??曜?????W}繹??曜?????W{^ 111001011110100000111111001111111100110111001011001111110011111100111111001111110011111101010111011111011110010111101000001111110011111111001101110010110011111100111111001111110011111100111111010101110111101101011110 e5e83f3fcdcb3f3f3f3f3f577de5e83f3fcdcb3f3f3f3f3f577b5e
UTF-8 繹먮젾曜쒕젡昻뽯젪W}繹먮젾曜쒕젡昻뽯젪W{^ 1110011110111001101110011110101110101000101011101110110010100000101111101110011010011011100111001110110010010010100101011110110010100000101000011110011010011000101110111110101110111101101011111110110010100000101010100101011101111101111001111011100110111001111010111010100010101110111011001010000010111110111001101001101110011100111011001001001010010101111011001010000010100001111001101001100010111011111010111011110110101111111011001010000010101010010101110111101101011110 e7b9b9eba8aeeca0bee69b9cec9295eca0a1e698bbebbdafeca0aa577de7b9b9eba8aeeca0bee69b9cec9295eca0a1e698bbebbdafeca0aa577b5e
UHC 繹먮젾曜쒕젡昻뽯젪W}繹먮젾曜쒕젡昻뽯젪W{^ 1110011010111010100100001110101110100000101100001110100011111000100111001110101110100000100110101110010011101001100101101110101110100000101000100101011101111101111001101011101010010000111010111010000010110000111010001111100010011100111010111010000010011010111001001110100110010110111010111010000010100010010101110111101101011110 e6ba90eba0b0e8f89ceba09ae4e996eba0a2577de6ba90eba0b0e8f89ceba09ae4e996eba0a2577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)