To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 貉ソ貉ソ薰先ケソ霆笠貉ソ貉ソ薰先ケソ霆顎^ 111001101011100110111111111001101011100110111111111110111001111010010000111001101011100110111111111010001011101110001010011111011110011010111001101111111110011010111001101111111111101110011110100100001110011010111001101111111110100010111011100010100111101101011110 e6b9bfe6b9bffb9e90e6b9bfe8bb8a7de6b9bfe6b9bffb9e90e6b9bfe8bb8a7b5e
EUC-JP 貉ソ貉ソ?先ケソ霆笠貉ソ貉ソ?先ケソ霆顎^ 111011001011101110001110101111111110110010111011100011101011111100111111110000001110100010001110101110011000111010111111111100001011110110110011110111101110110010111011100011101011111111101100101110111000111010111111001111111100000011101000100011101011100110001110101111111111000010111101101100111101110001011110 ecbb8ebfecbb8ebf3fc0e88eb98ebff0bdb3deecbb8ebfecbb8ebf3fc0e88eb98ebff0bdb3dc5e
UTF-8 貉ソ貉ソ薰先ケソ霆笠貉ソ貉ソ薰先ケソ霆顎^ 11101000101100101000100111101111101111011011111111101000101100101000100111101111101111011011111111101000100101101011000011100101100001011000100011101111101111011011100111101111101111011011111111101001100111001000011011100111101011001010000011101000101100101000100111101111101111011011111111101000101100101000100111101111101111011011111111101000100101101011000011100101100001011000100011101111101111011011100111101111101111011011111111101001100111001000011011101001101000011000111001011110 e8b289efbdbfe8b289efbdbfe896b0e58588efbdb9efbdbfe99c86e7aca0e8b289efbdbfe8b289efbdbfe896b0e58588efbdb9efbdbfe99c86e9a18e5e
UHC ????薰先??霆笠????薰先??霆顎^ 0011111100111111001111110011111111111101101110011110000010111011001111110011111111101111111111011101100010100010001111110011111100111111001111111111110110111001111000001011101100111111001111111110111111111101111001001100100101011110 3f3f3f3ffdb9e0bb3f3feffdd8a23f3f3f3ffdb9e0bb3f3feffde4c95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)