To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????翁脛??圓?????翁脛??圓?^ 001111110011111100111111001111111000100110100101111000111111100000111111001111111001101010100010001111110011111100111111001111110011111110001001101001011110001111111000001111110011111110011010101000100011111101011110 3f3f3f3f89a5e3f83f3f9aa23f3f3f3f3f89a5e3f83f3f9aa23f5e
EUC-JP 焌???翁脛??圓?焌???翁脛??圓?^ 10001111110010011110100000111111001111110011111110110010101001111110011011111010001111110011111111010100101001000011111110001111110010011110100000111111001111110011111110110010101001111110011011111010001111110011111111010100101001000011111101011110 8fc9e83f3f3fb2a7e6fa3f3fd4a43f8fc9e83f3f3fb2a7e6fa3f3fd4a43f5e
UTF-8 焌렱狀렡翁脛렖렖圓왼焌렱狀렡翁脛렖렖圓외^ 11100111100001001000110011101011101000001011000111101111101001111011101011101011101000001010000111100111101111111000000111101000100001001001101111101011101000001001011011101011101000001001011011100101100111001001001111101100100110011011110011100111100001001000110011101011101000001011000111101111101001111011101011101011101000001010000111100111101111111000000111101000100001001001101111101011101000001001011011101011101000001001011011100101100111001001001111101100100110011011100001011110 e7848ceba0b1efa7baeba0a1e7bf81e8849beba096eba096e59c93ec99bce7848ceba0b1efa7baeba0a1e7bf81e8849beba096eba096e59c93ec99b85e
UHC 焌렱狀렡翁脛렖렖圓왼焌렱狀렡翁脛렖렖圓외^ 1111000111100000100011101011111011101101111011101000111010110010111010001011101011001100111010111000111010101011100011101010101111101010101011011011111111011110111100011110000010001110101111101110110111101110100011101011001011101000101110101100110011101011100011101010101110001110101010111110101010101101101111111101110001011110 f1e08ebeedee8eb2e8bacceb8eab8eabeaadbfdef1e08ebeedee8eb2e8bacceb8eab8eabeaadbfdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)