A deep multimodal model for bug localization

Ziye Zhu,Yun Li,Yu Wang,Yaojing Wang,Hanghang Tong
DOI: https://doi.org/10.1007/s10618-021-00755-7
IF: 5.406
2021-04-28
Data Mining and Knowledge Discovery
Abstract:Bug localization utilizes the collected bug reports to locate the buggy source files. The state of the art falls short in handling the following three aspects, including (L1) the subtle difference between natural language and programming language, (L2) the noise in the bug reports and (L3) the multi-grained nature of programming language. To overcome these limitations, we propose a novel deep multimodal model named <span class="u-small-caps">DeMoB</span> for bug localization. It embraces three key features, each of which is tailored to address each of the three limitations. To be specific, the proposed <span class="u-small-caps">DeMoB</span> generates the multimodal coordinated representations for both bug reports and source files for addressing L1. It further incorporates the AttL encoder to process bug reports for addressing L2, and the MDCL encoder to process source files for addressing L3. Extensive experiments on four large-scale real-world data sets demonstrate that the proposed <span class="u-small-caps">DeMoB</span> significantly outperforms existing techniques.
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?