Abstract:Automated program repair (APR) aims to fix software bugs automatically and plays a crucial role in software development and maintenance. With the recent advances in deep learning (DL), an increasing number of APR techniques have been proposed to leverage neural networks to learn bug-fixing patterns from massive open-source code repositories. Such learning-based techniques usually treat APR as a neural machine translation (NMT) task, where buggy code snippets (i.e., source language) are translated into fixed code snippets (i.e., target language) automatically. Benefiting from the powerful capability of DL to learn hidden relationships from previous bug-fixing datasets, learning-based APR techniques have achieved remarkable performance. In this paper, we provide a systematic survey to summarize the current state-of-the-art research in the learning-based APR community. We illustrate the general workflow of learning-based APR techniques and detail the crucial components, including fault localization, patch generation, patch ranking, patch validation, and patch correctness phases. We then discuss the widely-adopted datasets and evaluation metrics and outline existing empirical studies. We discuss several critical aspects of learning-based APR techniques, such as repair domains, industrial deployment, and the open science issue. We highlight several practical guidelines on applying DL techniques for future APR studies, such as exploring explainable patch generation and utilizing code features. Overall, our paper can help researchers gain a comprehensive understanding about the achievements of the existing learning-based APR techniques and promote the practical application of these techniques. Our artifacts are publicly available at \url{<a class="link-external link-https" href="https://github.com/QuanjunZhang/AwesomeLearningAPR" rel="external noopener nofollow">this https URL</a>}.

Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java

StandUp4NPR: Standardizing SetUp for Empirically Comparing Neural Program Repair Systems.

Neural Program Repair : Systems, Challenges and Solutions

The Future Can’t Help Fix the Past: Assessing Program Repair in the Wild

RobustNPR: Evaluating the Robustness of Neural Program Repair Models

A critical review on the evaluation of automated program repair systems

Towards Reliable Evaluation of Neural Program Repair with Natural Robustness Testing

Syntax Guided Neural Program Repair

Neural Program Repair with Program Dependence Analysis and Effective Filter Mechanism

Benchmarking Educational Program Repair

RepairBench: Leaderboard of Frontier Models for Program Repair

A Survey of Learning-based Automated Program Repair

Tea: Program Repair Using Neural Network Based on Program Information Attention Matrix

Repairing Deep Neural Networks: Fix Patterns and Challenges

Automated Program Repair: Emerging trends pose and expose problems for benchmarks

How Effective Are Neural Networks for Fixing Security Vulnerabilities

You Cannot Fix What You Cannot Find! An Investigation of Fault Localization Bias in Benchmarking Automated Program Repair Systems

Practical Program Repair via Preference-based Ensemble Strategy

RunBugRun -- An Executable Dataset for Automated Program Repair

Where to Look When Repairing Code? Comparing the Attention of Neural Models and Developers

RePair: Automated Program Repair with Process-based Feedback