Abstract:Mutation-based fuzzing is a simple yet effective technique to discover bugs and security vulnerabilities in software. Given a set of well-formed initial seeds, mutation-based fuzzers continually generate interesting seeds by applying specific mutation strategy in order to maximize code coverage or the number of unique bugs explored at any point-in-time. However, existing fuzzers remain limited in the paths it could cover since it simply follows a uniform distribution to choose mutation operators. In this paper, we proposed a novel context-aware adaptive mutation scheme, namely CMFuzz, which utilizes a contextual bandit algorithm LinUCB to effectively choose optimal mutation operators for various seed files. To this end, CMFuzz dynamically extracts and encodes file characteristics, which allows mutation-based fuzzers to perform context-aware mutation. We apply this scheme on top of several state-of-the-art fuzzers, i.e., PTfuzz, AFL, and AFLFast, and implement CMFuzz-PT, CMFuzz-AFL, and CMFuzz-AFLFast, respectively. We conduct evaluation on 12 real-world open source applications and LAVA-M dataset against their counterparts. Extensive evaluations demonstrate that CMFuzz-based fuzzers achieve higher code coverage and find more crashes at a faster rate than their counterparts on most cases. Furthermore, we also utilize other mainstream bandit algorithms, e.g., Thompson Sample and epsilon-greedy, and implement Thompson-PT and Greedy-PT based on PTfuzz to examine the performance of proposed model. CMFuzz-PT significantly outperforms Thompson-PT especially in terms of unique crashes and paths, i.e., found 1.79× unique crashes and 1.29× unique paths on average. Compared to Greedy-PT, our approach still increases the amount of unique crashes and paths by 1.11× and 1.05×, respectively.

Seq2Seq-AFL: Fuzzing Via Sequence-to-sequence Model

FA-Fuzz: A Novel Scheduling Scheme Using Firefly Algorithm for Mutation-Based Fuzzing

AMSFuzz: an Adaptive Mutation Schedule for Fuzzing

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

SLF: fuzzing without valid seed inputs

Fuzzing with Quantitative and Adaptive Hot-Bytes Identification

AFLPro: Direction sensitive fuzzing

FairFuzz: a targeted mutation strategy for increasing greybox fuzz testing coverage

SAFL: increasing and accelerating testing coverage with symbolic execution and guided fuzzing.

CMFuzz: Context-Aware Adaptive Mutation for Fuzzers

A Guided Mutation Strategy for Smart Contract Fuzzing

Evolutionary Mutation-based Fuzzing as Monte Carlo Tree Search

LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

LearnAFL: Greybox Fuzzing with Knowledge Enhancement

MuFuzz: Sequence-Aware Mutation and Seed Mask Guidance for Blockchain Smart Contract Fuzzing

Not all bytes are equal: Neural byte sieve for fuzzing

MSFuzz: Augmenting Protocol Fuzzing with Message Syntax Comprehension Via Large Language Models

Fuzzing with Optimized Grammar-Aware Mutation Strategies

Improving Grey-Box Fuzzing by Modeling Program Behavior

Facilitating Parallel Fuzzing with mutually-exclusive Task Distribution

EcoFuzz: Adaptive Energy-Saving Greybox Fuzzing As a Variant of the Adversarial Multi-Armed Bandit