An Empirical Study on Bugs in Python Interpreters
Ziyuan Wang,Dexin Bu,Aiyue Sun,Shanyi Gou,Yong Wang,Lin Chen
DOI: https://doi.org/10.1109/tr.2022.3159812
IF: 5.883
2022-01-01
IEEE Transactions on Reliability
Abstract:Python is an interpreted programming language that has been widely used in many fields. The successful execution of a Python program depends on both the correctness of Python program and the correctness of Python interpreter. As an infrastructure software, there are many bugs in the Python interpreter. Exploring the bugs in Python interpreters can help developers and maintainers of Python interpreters detect and fix bugs and help users of Python avoid risks. In this article, we conduct an empirical study on the bugs in two mainstream Python interpreters: CPython and PyPy. By analyzing 25 958 fixed bugs, 18 824 revisions, 2 116 test cases, and root causes of randomly sampled 510 bugs, we have summarized the following findings. 1)The distribution of bugs in the Python interpreter is so uneven that the vast majority of bugs are distributed in a few components and source files. 2)The scales of the testing programs that reveal bugs are small. 3)The fixing works seem to be not complicated since the number of modified source files and lines of code are limited; however, most bugs need a long time to be fixed; nearly 15% of the bugs need more than one year to fix. 4)The priorities of bugs are independent of their locations, but they significantly correlate with duration of bugs. 5)Semantic bugs are the most frequent root causes of bugs, and their proportion exceeds other types of root causes. These results could indicate some potential problems during the detecting and fixing of Python interpreter's bugs, and provide some assistance to developers and maintainers of Python interpreters, users of Python, as well as researchers in related fields.
engineering, electrical & electronic,computer science, software engineering, hardware & architecture