To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鬘句叙諢戊ア「蟶晏ッー隍晢スア雎「閠ス 111010011010000110001011111001011000111110010110111001101000010010010101111010001011000110100010111001011011100010011101111001011010111110110000111010001010010010011101111011111011110110110001111010001011000110100010111010001000000010111101 e9a18be58f96e68495e8b1a2e5b89de5afb0e8a49defbdb1e8b1a2e880bd
EUC-JP 鬘句叙諢戊ア「蟶晏ッー隍晢スア雎「閠ス 1111001010100011101101101110011110111101111101101110101111100100110010101110101010001110101100011000111010100010111010101011101011011010111001111000111010101111100011101011000011110000101001101101101011110001100011101011110110001110101100011111000010110011100011101010001011101111111000001000111010111101 f2a3b6e7bdf6ebe4caea8eb18ea2eabadae78eaf8eb0f0a6daf18ebd8eb1f0b38ea2efe08ebd
UTF-8 鬘句叙諢戊ア「蟶晏ッー隍晢スア雎「閠ス 111010011010110010011000111001011000111110100101111001011000111110011001111010001010101110100010111001101000100010001010111011111011110110110001111011111011110110100010111010001001111110110110111001101001100110001111111011111011110110101111111011111011110110110000111010011001101010001101111001101001100110100010111011111011110110111101111011111011110110110001111010011001101110001110111011111011110110100010111010011001011010100000111011111011110110111101 e9ac98e58fa5e58f99e8aba2e6888aefbdb1efbda2e89fb6e6998fefbdafefbdb0e99a8de699a2efbdbdefbdb1e99b8eefbda2e996a0efbdbd
UHC ?句??戊???晏??隍???雎??? 001111111100111110100011001111110011111111011001111001100011111100111111001111111110010011001111001111110011111111111100110110110011111100111111001111111110111011010001001111110011111100111111 3fcfa33f3fd9e63f3f3fe4cf3f3ffcdb3f3f3feed13f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)