To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??桀??厓た????桀??厓た?純ビ^ 00111111001111111001111001111011001111110011111111111010100011011000001010111101001111110011111100111111001111111001111001111011001111110011111111111010100011011000001010111101001111111000111110000011100000110111001001011110 3f3f9e7b3f3ffa8d82bd3f3f3f3f9e7b3f3ffa8d82bd3f8f8383725e
EUC-JP ??桀??厓た?珹??桀??厓た?純ビ^ 0011111100111111110110111101110000111111001111111000111110110100110001111010010010111111001111111000111111001011111110110011111100111111110110111101110000111111001111111000111110110100110001111010010010111111001111111011110111100011101001011101001101011110 3f3fdbdc3f3f8fb4c7a4bf3f8fcbfb3f3fdbdc3f3f8fb4c7a4bf3fbde3a5d35e
UTF-8 룴창桀페룶厓た룶珹룴창桀페룶厓た룶純ビ^ 11101011101000111011010011101100101100001011110111100110101000011000000011101101100011101001100011101011101000111011011011100101100011101001001111100011100000011001111111101011101000111011011011100111100011111011100111101011101000111011010011101100101100001011110111100110101000011000000011101101100011101001100011101011101000111011011011100101100011101001001111100011100000011001111111101011101000111011011011100111101101001001010011100011100000111001001101011110 eba3b4ecb0bde6a180ed8e98eba3b6e58e93e3819feba3b6e78fb9eba3b4ecb0bde6a180ed8e98eba3b6e58e93e3819feba3b6e7b494e383935e
UHC 룴창桀페룶厓た룶珹룴창桀페룶厓た룶純ビ^ 100011111010100111000011101000101100101111111010110001101110010010001111101010111110010011101101101010101011111110001111101010111110000011111011100011111010100111000011101000101100101111111010110001101110010010001111101010111110010011101101101010101011111110001111101010111110001011101101101010111101001101011110 8fa9c3a2cbfac6e48fabe4edaabf8fabe0fb8fa9c3a2cbfac6e48fabe4edaabf8fabe2edabd35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)