Abstract:Context: Defect prediction is a very meaningful topic, particularly at change-level. Change-level defect prediction, which is also referred as just-in-time defect prediction, could not only ensure software quality in the development process, but also make the developers check and fix the defects in time [1].Objective: Ensemble learning becomes a hot topic in recent years. There have been several studies about applying ensemble learning to defect prediction [2-5]. Traditional ensemble learning approaches only have one layer, i.e., they use ensemble learning once. There are few studies that leverages ensemble learning twice or more. To bridge this research gap, we try to hybridize various ensemble learning methods to see if it will improve the performance of just-in-time defect prediction. In particular, we focus on one way to do this by hybridizing bagging and stacking together and leave other possibly hybridization strategies for future work.Method: In this paper, we propose a two-layer ensemble learning approach TLEL which leverages decision tree and ensemble learning to improve the performance of just-in-time defect prediction. In the inner layer, we combine decision tree and bagging to build a Random Forest model. In the outer layer, we use random under-sampling to train many different Random Forest models and use stacking to ensemble them once more.Results: To evaluate the performance of TLEL, we use two metrics, i.e., cost effectiveness and F1-score. We perform experiments on the datasets from six large open source projects, i.e., Bugzilla, Columba, JDT, Platform, Mozilla, and PostgreSQL, containing a total of 137,417 changes. Also, we compare our approach with three baselines, i.e., Deeper, the approach proposed by us [6], DNC, the approach proposed by Wang et al. [2], and MKEL, the approach proposed by Wang et al. [3]. The experimental results show that on average across the six datasets, TLEL could discover over 70% of the bugs by reviewing only 20% of the lines of code, as compared with about 50% for the baselines. In addition, the F1-scores TLEL can achieve are substantially and statistically significantly higher than those of three baselines across the six datasets.Conclusion: TLEL can achieve a substantial and statistically significant improvement over the state-of-the-art methods, i.e., Deeper, DNC and MKEL. Moreover, TLEL could discover over 70% of the bugs by reviewing only 20% of the lines of code. (C) 2017 Elsevier B.V. All rights reserved.

Robust Learning of Deep Predictive Models from Noisy and Imbalanced Software Engineering Datasets.

Unifying Defect Prediction, Categorization, and Repair by Multi-Task Deep Learning

Adversarial Learning from Imbalanced Data: A Robust Industrial Fault Classification Method

An Improved Semi-Supervised Learning Method for Software Defect Prediction.

Mitigating the impact of mislabeled data on deep predictive models: an empirical study of learning with noise approaches in software engineering tasks

Deep Learning for Just-In-Time Defect Prediction

Studying the effectiveness of deep active learning in software defect prediction

TLEL: A Two-Layer Ensemble Learning Approach for Just-in-time Defect Prediction

Robust Learning Against Label Noise Based on Activation Trend Tracking

A Survey of Different Approaches for the Class Imbalance Problem in Software Defect Prediction

Software visualization and deep transfer learning for effective software defect prediction

Software Defect Prediction Using Deep Q‐Learning Network‐Based Feature Extraction

A Novel Class-Imbalance Learning Approach for Both Within-Project and Cross-Project Defect Prediction.

Adaptive Centre-Weighted Oversampling for Class Imbalance in Software Defect Prediction

Software Defect Prediction Approach Based on a Diversity Ensemble Combined With Neural Network

Deep Semantic Feature Learning for Software Defect Prediction

Building Manufacturing Deep Learning Models with Minimal and Imbalanced Training Data Using Domain Adaptation and Data Augmentation

Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

Deep Incremental Learning of Imbalanced Data for Just-In-Time Software Defect Prediction

Comparative Study of Ensemble Learning Methods in Just-in-time Software Defect Prediction

Deep Learning Software Defect Prediction Methods for Cloud Environments Research