To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汁???雋孟???瓦????雋孟???瓦? 10001111011000000011111100111111001111111110100010110010100101101101000000111111001111110011111110001010101000100011111100111111001111110011111111101000101100101001011011010000001111110011111100111111100010101010001000111111 8f603f3f3fe8b296d03f3f3f8aa23f3f3f3fe8b296d03f3f3f8aa23f
EUC-JP 汁???雋孟???瓦????雋孟???瓦? 10111101110000010011111100111111001111111111000010110100110011001101001000111111001111110011111110110100101001000011111100111111001111110011111111110000101101001100110011010010001111110011111100111111101101001010010000111111 bdc13f3f3ff0b4ccd23f3f3fb4a43f3f3f3ff0b4ccd23f3f3fb4a43f
UTF-8 汁흗렓렜雋孟웃渽렜瓦얜웃渽렜雋孟웃渽렜瓦숲 111001101011000110000001111011011001110110010111111010111010000010010011111010111010000010011100111010011001101110001011111001011010110110011111111011001001101110000011111001101011100010111101111010111010000010011100111001111001001110100110111011001001011010011100111011001001101110000011111001101011100010111101111010111010000010011100111010011001101110001011111001011010110110011111111011001001101110000011111001101011100010111101111010111010000010011100111001111001001110100110111011001000100010110010 e6b181ed9d97eba093eba09ce99b8be5ad9fec9b83e6b8bdeba09ce793a6ec969cec9b83e6b8bdeba09ce99b8be5ad9fec9b83e6b8bdeba09ce793a6ec88b2
UHC 汁흗렓렜雋孟웃渽렜瓦얜웃渽렜雋孟웃渽렜瓦숲 111100011111000011001000111010011000111010101000100011101010111011110001111001101101100011101011101111111111010011101110101010101000111010101110111010001011111110111110111010111011111111110100111011101010101010001110101011101111000111100110110110001110101110111111111101001110111010101010100011101010111011101000101111111011110110100011 f1f0c8e98ea88eaef1e6d8ebbff4eeaa8eaee8bfbeebbff4eeaa8eaef1e6d8ebbff4eeaa8eaee8bfbda3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)