Abstract:Bug assignment, or bug triage, focuses on identifying the appropriate developers to repair newly discovered bugs, thereby managing them more effectively. Several deep learning-based approaches have been proposed for automated bug assignment. These approaches view automated bug assignment as a text classification task - the textual description of a bug report is utilized as the input and the potential fixers are regarded as the output labels. Such approaches typically depend on the classification performance of natural language processing and machine learning techniques. Various word embedding and deep learning models have emerged continuously. The effectiveness of those approaches depends on the chosen deep learning model, used for classification, and the word embedding model, used for representing bug reports. However, prior research does not empirically evaluate the impacts of various word embedding and deep learning models for automated bug assignment. In this paper, we conduct an empirical study to analyze the performance variations among 35 deep learning-based automated bug assignment approaches. These approaches are based on five word embedding techniques, i.e. , Word2Vec, GloVe, NextBug, ELMo, and BERT, and seven text classification models, i.e. , TextCNN, LSTM, Bi-LSTM, LSTM with attention, Bi-LSTM with attention, MLP, and Naive Bayes. We evaluated these combinations across three benchmark datasets, namely Eclipse JDT, GCC, and Firefox, and their mergence i.e., a cross-project dataset. Our main observations are: (1) Bi-LSTM with attention and Bi-LSTM using ELMo are significantly superior to other deep learning models on bug assignment tasks in terms of top-k (k = 1, 5, 10) accuracy and MRR; (2) Both the summary and description of bug reports are useful for bug assignment, but the description is more useful than the summary; (3) The training corpus for word embedding models has a significant impact on the performance of deep learning-based bug assignment methods. Our results show the importance of tuning different components (e.g. word embedding model, classification model, and textual input) in deep learning-based automated bug assignment methods and provide important insights for practitioners and researchers.

DeepLabel: Automated Issue Classification for Issue Tracking Systems

Empirically Revisiting and Enhancing Automatic Classification of Bug and Non-Bug Issues

A Bug or a Suggestion? An Automatic Way to Label Issues

Automated Bug Report Field Reassignment and Refinement Prediction

Automatic Issue Classifier: A Transfer Learning Framework for Classifying Issue Reports

High-Impact Bug Report Identification with Imbalanced Learning Strategies

Automated Identification of High Impact Bug Reports Leveraging Imbalanced Learning Strategies

MULA: A Just-In-Time Multi-labeling System for Issue Reports

A Survey on Recent Advances in Sequence Labeling from Deep Learning Models

Mitigating the impact of mislabeled data on deep predictive models: an empirical study of learning with noise approaches in software engineering tasks

Comparison of Machine Learning Methods for Assigning Software Issues to Team Members

An Empirical Study on Software Failure Classification with Multi-Label and Problem-Transformation Techniques

LabelEase: A Semi-Automatic Tool for Efficient and Accurate Trace Labeling in Microservices

Understanding and Tackling Label Errors in Deep Learning-Based Vulnerability Detection (experience Paper).

DeepAnna: Deep Learning Based Java Annotation Recommendation and Misuse Detection

An empirical assessment of different word embedding and deep learning models for bug assignment

Automated labeling of bugs and tickets using attention-based mechanisms in recurrent neural networks

Personalizing label prediction for GitHub issues

Classification with noisy labels through tree-based models and semi-supervised learning: A case study of lithology identification

Robust Learning of Deep Predictive Models from Noisy and Imbalanced Software Engineering Datasets.

Noisy Label Processing for Classification: A Survey