To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貔オ魘ォ閭晄イサ陬晁ア「魄暦スイ閭晄イサ陬 1110011010111110101101011110100110110100101010111110100010000011100111011110011010110010101110111110100010100011100111011110100010110001101000101110100110101110100101111110111110111101101100101110100010000011100111011110011010110010101110111110100010100011 e6beb5e9b4abe8839de6b2bbe8a39de8b1a2e9ae97efbdb2e8839de6b2bbe8a3
EUC-JP 貔オ魘ォ閭晄イサ陬晁ア「魄暦スイ閭晄イサ陬 111011001100000010001110101101011111001010110110100011101010101111101111111000111101101011101000100011101011001010001110101110111111000010100101110110101110101010001110101100011000111010100010111100101011000011001110111100011000111010111101100011101011001011101111111000111101101011101000100011101011001010001110101110111111000010100101 ecc08eb5f2b68eabefe3dae88eb28ebbf0a5daea8eb18ea2f2b0cef18ebd8eb2efe3dae88eb28ebbf0a5
UTF-8 貔オ魘ォ閭晄イサ陬晁ア「魄暦スイ閭晄イサ陬 111010001011001010010100111011111011110110110101111010011010110110011000111011111011110110101011111010011001011010101101111001101001100110000100111011111011110110110010111011111011110110111011111010011001100110101100111001101001100110000001111011111011110110110001111011111011110110100010111010011010110110000100111001101001101010100110111011111011110110111101111011111011110110110010111010011001011010101101111001101001100110000100111011111011110110110010111011111011110110111011111010011001100110101100 e8b294efbdb5e9ad98efbdabe996ade69984efbdb2efbdbbe999ace69981efbdb1efbda2e9ad84e69aa6efbdbdefbdb2e996ade69984efbdb2efbdbbe999ac
UHC ????閭晄???晁??魄???閭晄??? 001111110011111100111111001111111101010111101111111111001100110100111111001111110011111111110000110001010011111100111111110110111101111000111111001111110011111111010101111011111111110011001101001111110011111100111111 3f3f3f3fd5effccd3f3f3ff0c53f3fdbde3f3f3fd5effccd3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)