TrickyBugs: A Dataset of Corner-case Bugs in Plausible Programs

Kaibo Liu,Yudong Han,Yiyang Liu,Jie M. Zhang,Zhenpeng Chen,Federica Sarro,Gang Huang,Yun Ma
DOI: https://doi.org/10.1145/3643991.3644870
2024-01-01
Abstract:We call a program that passes existing tests but still contains bugs as a buggy plausible program. Bugs in such a program can bypass the testing environment and enter the production environment, causing unpredictable consequences. Therefore, discovering and fixing such bugs is a fundamental and critical problem. However, no existing bug dataset is purposed to collect this kind of bug, posing significant obstacles to relevant research. To address this gap, we introduce TrickyBugs, a bug dataset with 3,043 buggy plausible programs sourced from human-written submissions of 324 real-world competition coding tasks. We identified the buggy plausible programs from approximately 400,000 submissions, and all the bugs in TrickyBugs were not previously detected. We hope that TrickyBugs can effectively facilitate research in the fields of automated program repair, fault localization, test generation, and test adequacy.
What problem does this paper attempt to address?