To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 逹帷沮霓「闡ャ﨑閤逹帷沮霓「闡ャ﨑砿^ 111001111001110110011011111001111001111110011100111010001011110110100010111010001001000110101100111110101011000110001101011111011110011110011101100110111110011110011111100111001110100010111101101000101110100010010001101011001111101010110001100011010111101101011110 e79d9be79f9ce8bda2e891acfab18d7de79d9be79f9ce8bda2e891acfab18d7b5e
EUC-JP 逹帷沮霓「闡ャ?閤逹帷沮霓「闡ャ?砿^ 1110110111111101110101101110100111011101111111001111000010111111100011101010001011101111111100011000111010101100001111111011100111011110111011011111110111010110111010011101110111111100111100001011111110001110101000101110111111110001100011101010110000111111101110011101110001011110 edfdd6e9ddfcf0bf8ea2eff18eac3fb9deedfdd6e9ddfcf0bf8ea2eff18eac3fb9dc5e
UTF-8 逹帷沮霓「闡ャ﨑閤逹帷沮霓「闡ャ﨑砿^ 11101001100000001011100111100101101110001011011111100110101100101010111011101001100111001001001111101111101111011010001011101001100101111010000111101111101111011010110011101111101010001001000111101001100101101010010011101001100000001011100111100101101110001011011111100110101100101010111011101001100111001001001111101111101111011010001011101001100101111010000111101111101111011010110011101111101010001001000111100111101000001011111101011110 e980b9e5b8b7e6b2aee99c93efbda2e997a1efbdacefa891e996a4e980b9e5b8b7e6b2aee99c93efbda2e997a1efbdacefa891e7a0bf5e
UHC ??沮霓?闡??閤??沮霓?闡???^ 0011111100111111111011101100000111100111111001110011111111110100110001010011111100111111111110011110111000111111001111111110111011000001111001111110011100111111111101001100010100111111001111110011111101011110 3f3feec1e7e73ff4c53f3ff9ee3f3feec1e7e73ff4c53f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)