To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 荳ケ陲門穀蟾ス莉匁據荳ケ陲門穀蟾ス莉匁據^ 11100100101110001011100111101000101000101001011011100101100011011001001011100101101101111011110111100100101110111001011011100110100111011001111111100100101110001011100111101000101000101001011011100101100011011001001011100101101101111011110111100100101110111001011011100110100111011001111101011110 e4b8b9e8a296e58d92e5b7bde4bb96e69d9fe4b8b9e8a296e58d92e5b7bde4bb96e69d9f5e
EUC-JP 荳ケ陲門穀蟾ス莉匁據荳ケ陲門穀蟾ス莉匁據^ 1110100010111010100011101011100111110000101001001100110011100111101110011111001011101010101110011000111010111101111010001011110111001100111010001101101010100001111010001011101010001110101110011111000010100100110011001110011110111001111100101110101010111001100011101011110111101000101111011100110011101000110110101010000101011110 e8ba8eb9f0a4cce7b9f2eab98ebde8bdcce8daa1e8ba8eb9f0a4cce7b9f2eab98ebde8bdcce8daa15e
UTF-8 荳ケ陲門穀蟾ス莉匁據荳ケ陲門穀蟾ス莉匁據^ 11101000100011011011001111101111101111011011100111101001100110011011001011101001100101101000000011100111101010011000000011101000100111111011111011101111101111011011110111101000100011101000100111100101100011001000000111100110100100111001101011101000100011011011001111101111101111011011100111101001100110011011001011101001100101101000000011100111101010011000000011101000100111111011111011101111101111011011110111101000100011101000100111100101100011001000000111100110100100111001101001011110 e88db3efbdb9e999b2e99680e7a980e89fbeefbdbde88e89e58c81e6939ae88db3efbdb9e999b2e99680e7a980e89fbeefbdbde88e89e58c81e6939a5e
UHC 荳??門穀蟾?莉?據荳??門穀蟾?莉?據^ 110101001110010100111111001111111101101010100110110011011101101011100000111010100011111111010111111010010011111111001011111000001101010011100101001111110011111111011010101001101100110111011010111000001110101000111111110101111110100100111111110010111110000001011110 d4e53f3fdaa6cddae0ea3fd7e93fcbe0d4e53f3fdaa6cddae0ea3fd7e93fcbe05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)