To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????h???? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f
SJIS-WIN シト竺酌シト赦射シト軸柴シァhシト竺酌 101111001100010010001110101100011000111011011110101111001100010010001110110011011000111011001011101111001100010010001110101100101000111011000100101111001010011101101000101111001100010010001110101100011000111011011110 bcc48eb18edebcc48ecd8ecbbcc48eb28ec4bca768bcc48eb18ede
EUC-JP シト竺酌シト赦射シト軸柴シァhシト竺酌 10001110101111001000111011000100101111001011001110111100111000001000111010111100100011101100010010111100110011111011110011001101100011101011110010001110110001001011110010110100101111001100011010001110101111001000111010100111011010001000111010111100100011101100010010111100101100111011110011100000 8ebc8ec4bcb3bce08ebc8ec4bccfbccd8ebc8ec4bcb4bcc68ebc8ea7688ebc8ec4bcb3bce0
UTF-8 シト竺酌シト赦射シト軸柴シァhシト竺酌 11101111101111011011110011101111101111101000010011100111101010111011101011101001100001011000110011101111101111011011110011101111101111101000010011101000101101011010011011100101101100001000010011101111101111011011110011101111101111101000010011101000101110111011100011100110100111111011010011101111101111011011110011101111101111011010011101101000111011111011110110111100111011111011111010000100111001111010101110111010111010011000010110001100 efbdbcefbe84e7abbae9858cefbdbcefbe84e8b5a6e5b084efbdbcefbe84e8bbb8e69fb4efbdbcefbda768efbdbcefbe84e7abbae9858c
UHC ??竺酌??赦射??軸柴??h??竺酌 001111110011111111110101111001111110110111001100001111110011111111011110111101011101111011010010001111110011111111110101111011101110001111000011001111110011111101101000001111110011111111110101111001111110110111001100 3f3ff5e7edcc3f3fdef5ded23f3ff5eee3c33f3f683f3ff5e7edcc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)