A Study of Bug Resolution Characteristics in Popular Programming Languages

Jie M. Zhang,Feng Li,Dan Hao,Meng Wang,Hao Tang,Lu Zhang,Mark Harman
DOI: https://doi.org/10.1109/tse.2019.2961897
IF: 7.4
2021-01-01
IEEE Transactions on Software Engineering
Abstract:This paper presents a large-scale study that investigates the bug resolutioncharacteristics among popular Github projects written in different programminglanguages. We explore correlations but, of course, we cannot infer causation.Specifically, we analyse bug resolution data from approximately 70 millionSource Line of Code, drawn from 3 million commits to 600 GitHub projects,primarily written in 10 programming languages. We find notable variations inapparent bug resolution time and patch (fix) size. While interpretation ofresults from such large-scale empirical studies is inherently difficult, webelieve that the differences in medians are sufficiently large to warrantfurther investigation, replication, re-analysis and follow up research. Forexample, in our corpus, the median apparent bug resolution time (elapsed timefrom raise to resolve) for Ruby was 4X that for Go and 2.5X for Java. We alsofound that patches tend to touch more files for the corpus of strongly typedand for statically typed programs. However, we also found evidence for a lowerelapsed resolution time for bug resolution committed to projects constructedfrom statically typed languages. These findings, if replicated in subsequentfollow on studies, may shed further empirical light on the debate about theimportance of static typing.
What problem does this paper attempt to address?