Abstract:Bug assignment, or bug triage, focuses on identifying the appropriate developers to repair newly discovered bugs, thereby managing them more effectively. Several deep learning-based approaches have been proposed for automated bug assignment. These approaches view automated bug assignment as a text classification task - the textual description of a bug report is utilized as the input and the potential fixers are regarded as the output labels. Such approaches typically depend on the classification performance of natural language processing and machine learning techniques. Various word embedding and deep learning models have emerged continuously. The effectiveness of those approaches depends on the chosen deep learning model, used for classification, and the word embedding model, used for representing bug reports. However, prior research does not empirically evaluate the impacts of various word embedding and deep learning models for automated bug assignment. In this paper, we conduct an empirical study to analyze the performance variations among 35 deep learning-based automated bug assignment approaches. These approaches are based on five word embedding techniques, i.e. , Word2Vec, GloVe, NextBug, ELMo, and BERT, and seven text classification models, i.e. , TextCNN, LSTM, Bi-LSTM, LSTM with attention, Bi-LSTM with attention, MLP, and Naive Bayes. We evaluated these combinations across three benchmark datasets, namely Eclipse JDT, GCC, and Firefox, and their mergence i.e., a cross-project dataset. Our main observations are: (1) Bi-LSTM with attention and Bi-LSTM using ELMo are significantly superior to other deep learning models on bug assignment tasks in terms of top-k (k = 1, 5, 10) accuracy and MRR; (2) Both the summary and description of bug reports are useful for bug assignment, but the description is more useful than the summary; (3) The training corpus for word embedding models has a significant impact on the performance of deep learning-based bug assignment methods. Our results show the importance of tuning different components (e.g. word embedding model, classification model, and textual input) in deep learning-based automated bug assignment methods and provide important insights for practitioners and researchers.

LLM-BRC: A large language model-based bug report classification framework

Automated Bug Report Field Reassignment and Refinement Prediction

Improving Automated Bug Triaging with Specialized Topic Model.

Toward Understanding Deep Learning Framework Bugs

An empirical assessment of different word embedding and deep learning models for bug assignment

Understanding Bugs in Multi-Language Deep Learning Frameworks

High-Impact Bug Report Identification with Imbalanced Learning Strategies

On Reporting Performance and Accuracy Bugs for Deep Learning Frameworks: An Exploratory Study from GitHub

A Novel Deep-Learning-Based Bug Severity Classification Technique Using Convolutional Neural Networks and Random Forest with Boosting

Still Confusing for Bug-Component Triaging? Deep Feature Learning and Ensemble Setting to Rescue.

Evaluating Diverse Large Language Models for Automatic and General Bug Reproduction

A deep multimodal model for bug localization

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Automated Identification of High Impact Bug Reports Leveraging Imbalanced Learning Strategies

A Unified Framework for Bug Report Assignment.

When Large Language Models Confront Repository-Level Automatic Program Repair: How Well They Done?

Enhancing IR-based Fault Localization using Large Language Models

Software bug localization based on optimized and ensembled deep learning models

A Deep Dive into Large Language Models for Automated Bug Localization and Repair

Deep Learning Framework Testing Via Hierarchical and Heuristic Model Generation.