Abstract:Static analyzers reason about the behaviors of programs without executing them and report issues when they violate pre-defined desirable properties. One of the key limitations of static analyzers is their tendency to produce inaccurate and incomplete analysis results, i.e., they often generate too many spurious warnings and miss important issues. To help enhance the reliability of a static analyzer, developers usually manually write tests involving input programs and the corresponding expected analysis results for the analyzers. Meanwhile, a static analyzer often includes example programs in its documentation to demonstrate the desirable properties and/or their violations. Our key insight is that we can reuse programs extracted either from the official test suite or documentation and apply semantic-preserving transformations to them to generate variants. We studied the quality of input programs from these two sources and found that most rules in static analyzers are covered by at least one input program, implying the potential of using these programs as the basis for test generation. We present Statfier, a heuristic-based automated testing approach for static analyzers that generates program variants via semantic-preserving transformations and detects inconsistencies between the original program and variants (indicate inaccurate analysis results in the static analyzer). To select variants that are more likely to reveal new bugs, Statfier uses two key heuristics: (1) analysis report guided location selection that uses program locations in the reports produced by static analyzers to perform transformations and (2) structure diversity driven variant selection that chooses variants with different program contexts and diverse types of transformations. Our experiments with five popular static analyzers show that Statfier can find 79 bugs in these analyzers, of which 46 have been confirmed.

Statfier: Automated Testing of Static Analyzers Via Semantic-Preserving Program Transformations

Enhancing Bug-Inducing Commit Identification: A Fine-Grained Semantic Analysis Approach

Validating Static Warnings via Testing Code Fragments

An Empirical Study of False Negatives and Positives of Static Code Analyzers From the Perspective of Historical Issues

Canalyze: a Static Bug-Finding Tool for C Programs.

Understanding and Detecting Annotation-Induced Faults of Static Analyzers

Find bugs in static bug finders

A Static Analyzer for Numerical Programs in C and Fortran

Finding and Understanding Defects in Static Analyzers by Constructing Automated Oracles

Improving The Precision Of Static Analysis: Symbolic Execution Based On Ggcc Abstract Syntax Tree

StaticFixer: From Static Analysis to Static Repair

Supporting secure programming in web applications through interactive static analysis.

An Approach to Detecting Bugs in Pattern-Based Bug Detectors

Static automated program repair for heap properties

Automated Program-Semantic Defect Repair and False-Positive Elimination without Side Effects

Static analysis research of software automatic test

Automatically Inspecting Thousands of Static Bug Warnings with Large Language Model: How Far Are We?

Self-adaptive static analysis

Program Sifting: Select Property-Related Functions for Language-Based Static Analysis

Towards Understanding Fixes of SonarQube Static Analysis Violations: A Large-Scale Empirical Study

VBSAC: a value-based static analyzer for C