To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貔オ魘ォ釿晄イサ陬晁ア「魄暦スイ釿晄イサ陬 1110011010111110101101011110100110110100101010111110011111100001100111011110011010110010101110111110100010100011100111011110100010110001101000101110100110101110100101111110111110111101101100101110011111100001100111011110011010110010101110111110100010100011 e6beb5e9b4abe7e19de6b2bbe8a39de8b1a2e9ae97efbdb2e7e19de6b2bbe8a3
EUC-JP 貔オ魘ォ釿晄イサ陬晁ア「魄暦スイ釿晄イサ陬 111011001100000010001110101101011111001010110110100011101010101111101110111000111101101011101000100011101011001010001110101110111111000010100101110110101110101010001110101100011000111010100010111100101011000011001110111100011000111010111101100011101011001011101110111000111101101011101000100011101011001010001110101110111111000010100101 ecc08eb5f2b68eabeee3dae88eb28ebbf0a5daea8eb18ea2f2b0cef18ebd8eb2eee3dae88eb28ebbf0a5
UTF-8 貔オ魘ォ釿晄イサ陬晁ア「魄暦スイ釿晄イサ陬 111010001011001010010100111011111011110110110101111010011010110110011000111011111011110110101011111010011000011110111111111001101001100110000100111011111011110110110010111011111011110110111011111010011001100110101100111001101001100110000001111011111011110110110001111011111011110110100010111010011010110110000100111001101001101010100110111011111011110110111101111011111011110110110010111010011000011110111111111001101001100110000100111011111011110110110010111011111011110110111011111010011001100110101100 e8b294efbdb5e9ad98efbdabe987bfe69984efbdb2efbdbbe999ace69981efbdb1efbda2e9ad84e69aa6efbdbdefbdb2e987bfe69984efbdb2efbdbbe999ac
UHC ?????晄???晁??魄????晄??? 00111111001111110011111100111111001111111111110011001101001111110011111100111111111100001100010100111111001111111101101111011110001111110011111100111111001111111111110011001101001111110011111100111111 3f3f3f3f3ffccd3f3f3ff0c53f3fdbde3f3f3f3ffccd3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)