Abstract:For efficient malware detection, there are more and more deep learning methods based on raw software binaries. Recent studies show that deep learning models can easily be fooled to make a wrong decision by introducing subtle perturbations to inputs, which attracts a large influx of work in adversarial attacks. However, most of the existing attack methods are based on manual features (e.g., API calls) or in the white-box setting, making the attacks impractical in current real-world scenarios. In this work, we propose a novel attack framework called GAPGAN, which generates adversarial payloads (padding bytes) with generative adversarial networks (GANs). To the best of our knowledge, it is the first work that performs endto-end black-box attacks at the byte-level against deep learning based malware binaries detection. In our attack framework, we map input discrete malware binaries to continuous space, then feed it to the generator of GAPGAN to generate adversarial payloads. We append payloads to the original binaries to craft an adversarial sample while preserving its functionality. We propose to use a dynamic threshold for reducing the loss of the effectiveness of the payloads when mapping it from continuous format back to the original discrete format. For balancing the attention of the generator to the payloads and the adversarial samples, we use an automatic weight tuning strategy. We train GAPGAN with both malicious and benign software. Once the training is finished, the generator can generate an adversarial sample with only the input malware in less than twenty milliseconds. We apply GAPGAN to attack the state-of-the-art detector MalConv and achieve 100% attack success rate with only appending payloads of 2.5% of the total length of the data for detection. We also attack deep learning models with different structures under different defense methods. The experiments show that GAPGAN outperforms other state-of-the-art attack models in efficiency and effectiveness.

ChatGPT as an Attack Tool: Stealthy Textual Backdoor Attack via Blackbox Generative Model Trigger

BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models

Towards Efficient Data Free Blackbox Adversarial Attack

BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT

Black-Box Adversarial Attacks Against Deep Learning Based Malware Binaries Detection with GAN

Adversarial Attacks on Large Language Model-Based System and Mitigating Strategies: A Case Study on ChatGPT

From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and Privacy

The Devil is in the GAN: Backdoor Attacks and Defenses in Deep Generative Models

PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models

Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Model

How Robust Is a Large Pre-trained Language Model for Code Generationƒ A Case on Attacking GPT2

Claim-Guided Textual Backdoor Attack for Practical Applications

Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models

Decoding the Threat Landscape : ChatGPT, FraudGPT, and WormGPT in Social Engineering Attacks

Goal-guided Generative Prompt Injection Attack on Large Language Models

Watch Out for Your Guidance on Generation! Exploring Conditional Backdoor Attacks against Large Language Models

Backdooring Bias into Text-to-Image Models

ChatGPT-Generated Code Assignment Detection Using Perplexity of Large Language Models (Student Abstract)

Exploiting Novel GPT-4 APIs

Instruction Backdoor Attacks Against Customized LLMs