To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 薰ェ螳、魘抵セ暦セ樶が螳、魘抵セ暦セ弯 11111011100111101010101011100101101011101010010011101001101101001001001011101111101111101001011111101111101111101001111011101110100000101010101011100101101011101010010011101001101101001001001011101111101111101001011111101111101111101001110001011110 fb9eaae5aea4e9b492efbe97efbe9eee82aae5aea4e9b492efbe97efbe9c5e
EUC-JP ?ェ螳、魘抵セ暦セ樶が螳、魘抵セ暦セ弯 00111111100011101010101011101010101100001000111010100100111100101011011011000100111100011000111010111110110011101111000110001110101111101101110011110000101001001010110011101010101100001000111010100100111100101011011011000100111100011000111010111110110011101111000110001110101111101101011110111111 3f8eaaeab08ea4f2b6c4f18ebecef18ebedcf0a4aceab08ea4f2b6c4f18ebecef18ebed7bf
UTF-8 薰ェ螳、魘抵セ暦セ樶が螳、魘抵セ暦セ弯 111010001001011010110000111011111011110110101010111010001001111010110011111011111011110110100100111010011010110110011000111001101000101010110101111011111011110110111110111001101001101010100110111011111011110110111110111001101010100010110110111000111000000110001100111010001001111010110011111011111011110110100100111010011010110110011000111001101000101010110101111011111011110110111110111001101001101010100110111011111011110110111110111001011011110010101111 e896b0efbdaae89eb3efbda4e9ad98e68ab5efbdbee69aa6efbdbee6a8b6e3818ce89eb3efbda4e9ad98e68ab5efbdbee69aa6efbdbee5bcaf
UHC 薰?螳??抵????が螳??抵???? 11111101101110010011111111010011110110010011111100111111111011101011110100111111001111110011111100111111101010101010110011010011110110010011111100111111111011101011110100111111001111110011111100111111 fdb93fd3d93f3feebd3f3f3f3faaacd3d93f3feebd3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)