Abstract:Due to the critical role of compilers, many compiler testing techniques have been proposed, two most notable categories among which are grammar-based and metamorphic-based techniques. All of them have been extensively studied for testing mature compilers. However, it is typical to develop a new compiler for a new-born programming language in practice. In this scenario, the existing techniques are hardly applicable due to some major reasons: (1) no reference compilers to support differential testing, (2) lack of program analysis tools to support most of metamorphic-based compiler testing, (3) substantial implementation effort incurred by different programming language features. Hence, it is unknown how the existing techniques perform in this new scenario. In this work, we conduct the first exploration (i.e., an industrial case study) to investigate the performance of the existing techniques in this new scenario with substantial adaptations. We adapted grammar-based compiler testing to this scenario by synthesizing new test programs based on code snippets and using compilation crash as test oracle due to the lack of reference compilers for differential testing. We also adapted metamorphic-based compiler testing to this scenario by constructing equivalent test programs under any inputs to relieve the dependence on program analysis tools. We call the adapted techniques SynFuzz and MetaFuzz, respectively. We evaluated both SynFuzz and MetaFuzz on two versions of a new compiler for a new-born programming language in a global IT company. By comparing with the testing practice adopted by the testing team and the general fuzzer (AFL), SynFuzz can detect more bugs during the same testing time, and both SynFuzz and MetaFuzz can complement the other two techniques. In particular, SynFuzz and MetaFuzz have detected 11 previously unknown bugs, all of which have been fixed by the developers. From the industrial case study, we summarized a series of lessons and suggestions for practical use and future research.

History-driven Test Program Synthesis for JVM Testing

JITO: a Tool for Just-in-time Defect Identification and Localization

Deep Differential Testing of JVM Implementations

Detecting JVM JIT Compiler Bugs via Exploring Two-Dimensional Input Spaces

Effective code coverage in compositional systematic dynamic testing

Coverage-directed Differential Testing of JVM Implementations.

History-Guided Configuration Diversification for Compiler Test-Program Generation

Constructing Exception Handling Chains for Testing Java Virtual Machine Implementations

Java JIT Testing with Template Extraction

DiffGen: Automated Unit Test Generation for Regression Testing

An Empirical Study on Automated Test Generation Tools for Java: Effectiveness and Challenges

Directed Test Program Generation for JIT Compiler Bug Localization

Synthesizing Method Sequences for High-Coverage Testing

Pattern-Based Peephole Optimizations with Java JIT Tests

Isomorphic Regression Testing: Executing Uncovered Branches Without Test Augmentation.

DiffGen: Automated Regression Unit-Test Generation

Complete Shadow Symbolic Execution with Java PathFinder

Testing the Compiler for a New-Born Programming Language: An Industrial Case Study (Experience Paper)

Program Tailoring: Slicing by Sequential Criteria

Validating JIT Compilers via Compilation Space Exploration

An Integrated Regression Testing Framework to Multi-Threaded Java Programs