To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 遶ェ莉門夋隱ー鞜懈錐遶ェ螟壻ソ苓ェー鞜懈拷 111001111010101110101010111001001011101110010110111001011111101010011111111010001010101010110000111010001101111110011100111001101001000010001101111001111010101110101010111001011010010010011010111001001011111110010111111010001010101010110000111010001101111110011100111001101000110110001001 e7abaae4bb96e5fa9fe8aab0e8df9ce6908de7abaae5a49ae4bf97e8aab0e8df9ce68d89
EUC-JP 遶ェ莉門夋隱ー鞜懈錐遶ェ螟壻ソ苓ェー鞜懈拷 11101110101011011000111010101010111010001011110111001100111001111000111110111000111000011111000010101100100011101011000011110000111000011101100011101000101111111110110111101110101011011000111010101010111010101010011011010100111001101000111010111111110011101110101010001110101010101000111010110000111100001110000111011000111010001011100111101001 eead8eaae8bdcce78fb8e1f0ac8eb0f0e1d8e8bfedeead8eaaeaa6d4e68ebfceea8eaa8eb0f0e1d8e8b9e9
UTF-8 遶ェ莉門夋隱ー鞜懈錐遶ェ螟壻ソ苓ェー鞜懈拷 111010011000000110110110111011111011110110101010111010001000111010001001111010011001011010000000111001011010010010001011111010011001101010110001111011111011110110110000111010011001111010011100111001101000011110001000111010011000110010010000111010011000000110110110111011111011110110101010111010001001111010011111111001011010001110111011111011111011110110111111111010001000101110010011111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110001000111001101000101110110111 e981b6efbdaae88e89e99680e5a48be99ab1efbdb0e99e9ce68788e98c90e981b6efbdaae89e9fe5a3bbefbdbfe88b93efbdaaefbdb0e99e9ce68788e68bb7
UHC ??莉門?隱??懈錐??螟壻?????懈拷 001111110011111111010111111010011101101010100110001111111110101111011111001111110011111111111010101010111111010111011110001111110011111111011001101011011101111111101011001111110011111100111111001111110011111111111010101010111100110110111000 3f3fd7e9daa63febdf3f3ffaabf5de3f3fd9addfeb3f3f3f3f3ffaabcdb8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)