LLM-BRC: A large language model-based bug report classification framework

Xiaoting Du,Zhihao Liu,Chenglong Li,Xiangyue Ma,Yingzhuo Li,Xinyu Wang
DOI: https://doi.org/10.1007/s11219-024-09675-3
2024-05-25
Software Quality Journal
Abstract:Deep learning frameworks serve as the cornerstone for constructing robust deep learning systems. However, bugs within these frameworks can have severe consequences, negatively affecting various applications. Accurately classifying and understanding these bugs is essential to ensure framework reliability. By doing so, developers can proactively take appropriate measures to mitigate potential risks associated with specific bug types in both current and future software releases. Despite the significance of bug report classification, existing methods fall short in terms of performance, rendering them impractical for real-world applications. To address this limitation, we propose a bug report classification framework for deep learning frameworks, called LLM–BRC, leveraging OpenAI's latest embedding model, text-embedding-ada-002 . Our LLM–BRC framework achieves an impressive accuracy range of 92% to 98.75% in bug report classification for three deep learning frameworks: TensorFlow, MXNET, and PaddlePaddle. This represents a substantial improvement of 17.21% to 69.15% compared to existing methods. Furthermore, we conduct a comprehensive investigation into the impact of different bug report components and different models.
computer science, software engineering
What problem does this paper attempt to address?