To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠇幢スカ﨡会スス閠ウ蠇懶スイ﨡会スス闥膿 11111011101000011001101111101111101111011011011011111011101000001000100111101111101111011011110111101000100000001011001111111011101000011001110011101111101111011011001011111011101000001000100111101111101111011011110111101000100100101001010001011110 fba19befbdb6fba089efbdbde880b3fba19cefbdb2fba089efbdbde892945e
EUC-JP ?幢スカ?会スス閠ウ?懶スイ?会スス闥膿 001111111101011011110001100011101011110110001110101101100011111110110010111100011000111010111101100011101011110111101111111000001000111010110011001111111101100011110001100011101011110110001110101100100011111110110010111100011000111010111101100011101011110111101111111100101100011110111111 3fd6f18ebd8eb63fb2f18ebd8ebdefe08eb33fd8f18ebd8eb23fb2f18ebd8ebdeff2c7bf
UTF-8 蠇幢スカ﨡会スス閠ウ蠇懶スイ﨡会スス闥膿 111010001010000010000111111001011011100110100010111011111011110110111101111011111011110110110110111011111010100010100001111001001011110010011010111011111011110110111101111011111011110110111101111010011001011010100000111011111011110110110011111010001010000010000111111001101000011110110110111011111011110110111101111011111011110110110010111011111010100010100001111001001011110010011010111011111011110110111101111011111011110110111101111010011001011110100101111010001000011010111111 e8a087e5b9a2efbdbdefbdb6efa8a1e4bc9aefbdbdefbdbde996a0efbdb3e8a087e687b6efbdbdefbdb2efa8a1e4bc9aefbdbdefbdbde997a5e886bf
UHC ?幢?????????懶???????膿 0011111111010011110100110011111100111111001111110011111100111111001111110011111100111111001111111101010011111011001111110011111100111111001111110011111100111111001111111101001011011011 3fd3d33f3f3f3f3f3f3f3f3fd4fb3f3f3f3f3f3f3fd2db

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)