To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠍エ隶悟カク譎ィ荳ソ蟾仙符荵ょカク譎ィ荳ソ 111001011011011010110100111010001010111010001100111001011011011010111000111001101001100110101000111001001011100010111111111001011011011110010000111001011001010110000100111001001011100110000010111001011011011010111000111001101001100110101000111001001011100010111111 e5b6b4e8ae8ce5b6b8e699a8e4b8bfe5b790e59584e4b982e5b6b8e699a8e4b8bf
EUC-JP 蠍エ隶悟カク譎ィ荳ソ蟾仙符荵ょカク譎ィ荳ソ 111010101011100010001110101101001111000010110000101110001110011110001110101101101000111010111000111010111111100110001110101010001110100010111010100011101011111111101010101110011100000011100111110010011110010011101000101110111010010011100111100011101011011010001110101110001110101111111001100011101010100011101000101110101000111010111111 eab88eb4f0b0b8e78eb68eb8ebf98ea8e8ba8ebfeab9c0e7c9e4e8bba4e78eb68eb8ebf98ea8e8ba8ebf
UTF-8 蠍エ隶悟カク譎ィ荳ソ蟾仙符荵ょカク譎ィ荳ソ 111010001010000010001101111011111011110110110100111010011001101010110110111001101000001010011111111011111011110110110110111011111011110110111000111010001010110110001110111011111011110110101000111010001000110110110011111011111011110110111111111010001001111110111110111001001011101110011001111001111010110010100110111010001000110110110101111000111000001010000111111011111011110110110110111011111011110110111000111010001010110110001110111011111011110110101000111010001000110110110011111011111011110110111111 e8a08defbdb4e99ab6e6829fefbdb6efbdb8e8ad8eefbda8e88db3efbdbfe89fbee4bb99e7aca6e88db5e38287efbdb6efbdb8e8ad8eefbda8e88db3efbdbf
UHC ???悟??譎?荳?蟾仙符?ょ??譎?荳? 001111110011111100111111111001111111011000111111001111111111110111010010001111111101010011100101001111111110000011101010111000001011100111011101101011000011111110101010111001110011111100111111111111011101001000111111110101001110010100111111 3f3f3fe7f63f3ffdd23fd4e53fe0eae0b9ddac3faae73f3ffdd23fd4e53f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)