To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nkf???nk^}Y???nkf???nk^}bE 0011111100111111001111110110111001101011011001100011111100111111001111110110111001101011010111100111110101011001001111110011111100111111011011100110101101100110001111110011111100111111011011100110101101011110011111010110001001000101 3f3f3f6e6b663f3f3f6e6b5e7d593f3f3f6e6b663f3f3f6e6b5e7d6245
SJIS-WIN 薑〕箕nkf薑〕箕nk^}Y薑〕箕nkf薑〕箕nk^}bE 1110010101000111100000010110110010010110101001010110111001101011011001101110010101000111100000010110110010010110101001010110111001101011010111100111110101011001111001010100011110000001011011001001011010100101011011100110101101100110111001010100011110000001011011001001011010100101011011100110101101011110011111010110001001000101 e547816c96a56e6b66e547816c96a56e6b5e7d59e547816c96a56e6b66e547816c96a56e6b5e7d6245
EUC-JP 薑〕箕nkf薑〕箕nk^}Y薑〕箕nkf薑〕箕nk^}bE 1110100110101000101000011100110111001100101001110110111001101011011001101110100110101000101000011100110111001100101001110110111001101011010111100111110101011001111010011010100010100001110011011100110010100111011011100110101101100110111010011010100010100001110011011100110010100111011011100110101101011110011111010110001001000101 e9a8a1cdcca76e6b66e9a8a1cdcca76e6b5e7d59e9a8a1cdcca76e6b66e9a8a1cdcca76e6b5e7d6245
UTF-8 薑〕箕nkf薑〕箕nk^}Y薑〕箕nkf薑〕箕nk^}bE 1110100010010110100100011110001110000000100101011110011110101110100101010110111001101011011001101110100010010110100100011110001110000000100101011110011110101110100101010110111001101011010111100111110101011001111010001001011010010001111000111000000010010101111001111010111010010101011011100110101101100110111010001001011010010001111000111000000010010101111001111010111010010101011011100110101101011110011111010110001001000101 e89691e38095e7ae956e6b66e89691e38095e7ae956e6b5e7d59e89691e38095e7ae956e6b66e89691e38095e7ae956e6b5e7d6245
UHC 薑〕箕nkf薑〕箕nk^}Y薑〕箕nkf薑〕箕nk^}bE 1100101110111001101000011011001111010001101110010110111001101011011001101100101110111001101000011011001111010001101110010110111001101011010111100111110101011001110010111011100110100001101100111101000110111001011011100110101101100110110010111011100110100001101100111101000110111001011011100110101101011110011111010110001001000101 cbb9a1b3d1b96e6b66cbb9a1b3d1b96e6b5e7d59cbb9a1b3d1b96e6b66cbb9a1b3d1b96e6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)