To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 辷イ貉ソ遽晁楳薰迎辷イ貉ソ遽晁楳薰鶏^ 111001111000100010110010111001101011100110111111111001111010111110011101111010001001010010000000111110111001111010001100011111011110011110001000101100101110011010111001101111111110011110101111100111011110100010010100100000001111101110011110100011000111101101011110 e788b2e6b9bfe7af9de89480fb9e8c7de788b2e6b9bfe7af9de89480fb9e8c7b5e
EUC-JP 辷イ貉ソ遽晁楳?迎辷イ貉ソ遽晁楳?鶏^ 1110110111101000100011101011001011101100101110111000111010111111111011101011000111011010111010101100011111100000001111111011011111011110111011011110100010001110101100101110110010111011100011101011111111101110101100011101101011101010110001111110000000111111101101111101110001011110 ede88eb2ecbb8ebfeeb1daeac7e03fb7deede88eb2ecbb8ebfeeb1daeac7e03fb7dc5e
UTF-8 辷イ貉ソ遽晁楳薰迎辷イ貉ソ遽晁楳薰鶏^ 11101000101111101011011111101111101111011011001011101000101100101000100111101111101111011011111111101001100000011011110111100110100110011000000111100110101001011011001111101000100101101011000011101000101111111000111011101000101111101011011111101111101111011011001011101000101100101000100111101111101111011011111111101001100000011011110111100110100110011000000111100110101001011011001111101000100101101011000011101001101101101000111101011110 e8beb7efbdb2e8b289efbdbfe981bde69981e6a5b3e896b0e8bf8ee8beb7efbdb2e8b289efbdbfe981bde69981e6a5b3e896b0e9b68f5e
UHC ????遽晁?薰迎????遽晁?薰?^ 0011111100111111001111110011111111001011111010001111000011000101001111111111110110111001111001111100101000111111001111110011111100111111110010111110100011110000110001010011111111111101101110010011111101011110 3f3f3f3fcbe8f0c53ffdb9e7ca3f3f3f3fcbe8f0c53ffdb93f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)