To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN □?螢ァ? 胥杷ァ?尸□?螢ァ? 胥杷ァ?尸^ 100000011010000000111111111001011010001110000011010000000011111110000001010000001110001111101111100101000110011010000011010000000011111110011011100110011000000110100000001111111110010110100011100000110100000000111111100000010100000011100011111011111001010001100110100000110100000000111111100110111001100101011110 81a03fe5a383403f8140e3ef946683403f9b9981a03fe5a383403f8140e3ef946683403f9b995e
EUC-JP □?螢ァ? 胥杷ァ?尸□?螢ァ? 胥杷ァ?尸^ 101000101010001000111111111010101010010110100101101000010011111110100001101000011110011011110001110001111100011110100101101000010011111111010101111110011010001010100010001111111110101010100101101001011010000100111111101000011010000111100110111100011100011111000111101001011010000100111111110101011111100101011110 a2a23feaa5a5a13fa1a1e6f1c7c7a5a13fd5f9a2a23feaa5a5a13fa1a1e6f1c7c7a5a13fd5f95e
UTF-8 □룫螢ァ룶 胥杷ァ룵尸□룫螢ァ룶 胥杷ァ룵尸^ 11100010100101101010000111101011101000111010101111101000100111101010001011100011100000101010000111101011101000111011011011100011100000001000000011101000100000111010010111100110100111011011011111100011100000101010000111101011101000111011010111100101101100001011100011100010100101101010000111101011101000111010101111101000100111101010001011100011100000101010000111101011101000111011011011100011100000001000000011101000100000111010010111100110100111011011011111100011100000101010000111101011101000111011010111100101101100001011100001011110 e296a1eba3abe89ea2e382a1eba3b6e38080e883a5e69db7e382a1eba3b5e5b0b8e296a1eba3abe89ea2e382a1eba3b6e38080e883a5e69db7e382a1eba3b5e5b0b85e
UHC □룫螢ァ룶 胥杷ァ룵尸□룫螢ァ룶 胥杷ァ룵尸^ 101000011110000010001111101000101111101110101011101010111010000110001111101010111010000110100001111000001010000111110111111011011010101110100001100011111010101011100011101110011010000111100000100011111010001011111011101010111010101110100001100011111010101110100001101000011110000010100001111101111110110110101011101000011000111110101010111000111011100101011110 a1e08fa2fbababa18faba1a1e0a1f7edaba18faae3b9a1e08fa2fbababa18faba1a1e0a1f7edaba18faae3b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)