To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 驕擾スュ隴鯉スヲW}驕擾スュ隴鯉スヲW{^ 1110100110000001100011111110111110111101101011011110100010101101100011001110111110111101101001100101011101111101111010011000000110001111111011111011110110101101111010001010110110001100111011111011110110100110010101110111101101011110 e9818fefbdade8ad8cefbda6577de9818fefbdade8ad8cefbda6577b5e
EUC-JP 驕擾スュ隴鯉スヲW}驕擾スュ隴鯉スヲW{^ 11110001111000011011111011110001100011101011110110001110101011011111000010101111101110001111000110001110101111011000111010100110010101110111110111110001111000011011111011110001100011101011110110001110101011011111000010101111101110001111000110001110101111011000111010100110010101110111101101011110 f1e1bef18ebd8eadf0afb8f18ebd8ea6577df1e1bef18ebd8eadf0afb8f18ebd8ea6577b5e
UTF-8 驕擾スュ隴鯉スヲW}驕擾スュ隴鯉スヲW{^ 1110100110101001100101011110011010010011101111101110111110111101101111011110111110111101101011011110100110011010101101001110100110101111100010011110111110111101101111011110111110111101101001100101011101111101111010011010100110010101111001101001001110111110111011111011110110111101111011111011110110101101111010011001101010110100111010011010111110001001111011111011110110111101111011111011110110100110010101110111101101011110 e9a995e693beefbdbdefbdade99ab4e9af89efbdbdefbda6577de9a995e693beefbdbdefbdade99ab4e9af89efbdbdefbda6577b5e
UHC 驕擾???鯉??W}驕擾???鯉??W{^ 110011101111011011101000111101100011111100111111001111111101011111101111001111110011111101010111011111011100111011110110111010001111011000111111001111110011111111010111111011110011111100111111010101110111101101011110 cef6e8f63f3f3fd7ef3f3f577dcef6e8f63f3f3fd7ef3f3f577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)