Abstract:Bug triaging refers to the process of assigning a bug to the most appropriate developer to fix. It becomes more and more difficult and complicated as the size of software and the number of developers increase. In this paper, we propose a new framework for bug triaging, which maps the words in the bug reports (i.e., the term space) to their corresponding topics (i.e., the topic space). We propose a specialized topic modeling algorithm named multi-feature topic model (MTM) which extends Latent Dirichlet Allocation (LDA) for bug triaging. MTM considers product and component information of bug reports to map the term space to the topic space. Finally, we propose an incremental learning method named TopicMiner which considers the topic distribution of a new bug report to assign an appropriate fixer based on the affinity of the fixer to the topics. We pair TopicMiner with MTM (TopicMiner$^{MTM}$ ). We have evaluated our solution on 5 large bug report datasets including GCC, OpenOffice, Mozilla, Netbeans, and Eclipse containing a total of 227,278 bug reports. We show that TopicMiner $^{MTM}$ can achieve top-1 and top-5 prediction accuracies of 0.4831-0.6868, and 0.7686-0.9084, respectively. We also compare TopicMiner$^{MTM}$ with Bugzie, LDA-KL, SVM-LDA, LDA-Activity, and Yang et al.'s approach. The results show that TopicMiner $^{MTM}$ on average improves top-1 and top-5 prediction accuracies of Bugzie by 128.48 and 53.22 percent, LDA-KL by 262.91 and 105.97 percent, SVM-LDA by 205.89 and 110.48 percent, LDA-Activity by 377.60 and 176.32 percent, and Yang et al.'s approach by 59.88 and 13.70 percent, respectively.

TAM: Targeted Analysis Model With Reinforcement Learning on Short Texts

Improving Automated Bug Triaging with Specialized Topic Model.

Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

A biterm topic model for short texts

Utilizing Recurrent Neural Network for Topic Discovery in Short Text Scenarios

Topic Discovery for Streaming Short Texts with CTM.

A Joint Model Of Extended Lda And Ibtm Over Streaming Chinese Short Texts

Modeling over Short Texts

BTM: Topic Modeling over Short Texts

Using Topic Modeling Methods for Short-Text Data: A Comparative Analysis

Context Reinforced Neural Topic Modeling over Short Texts

DATM: A Novel Data Agnostic Topic Modeling Technique With Improved Effectiveness for Both Short and Long Text

Improved BTM topic embedding method for Web text data extraction

Tipster: A Topic-Guided Language Model for Topic-Aware Text Segmentation.

Fast Supervised Topic Models for Short Text Emotion Detection

TSSE-DMM: Topic Modeling for Short Texts Based on Topic Subdivision and Semantic Enhancement

A topic-enhanced dirichlet model for short text stream clustering

A Self-adaptive Sliding Window Based Topic Model for Non-uniform Texts

Topic model based on co-occurrence word networks for unbalanced short text datasets

TDAM: a Topic-Dependent Attention Model for Sentiment Analysis

Short Text Topic Modeling With Flexible Word Patterns