Abstract:Software defects are well-known in software development and might cause several problems for users and developers aside. As a result, researches employed distinct techniques to mitigate the impacts of these defects in the source code. One of the most notable techniques focuses on defect prediction using machine learning methods, which could support developers in handling these defects before they are introduced in the production environment. These studies provide alternative approaches to predict the likelihood of defects. However, most of these works concentrate on predicting defects from a vast set of software features. Another key issue with the current literature is the lack of a satisfactory explanation of the reasons that drive the software to a defective state. Specifically, we use a tree boosting algorithm (XGBoost) that receives as input a training set comprising records of easy-to-compute characteristics of each module and outputs whether the corresponding module is defect-prone. To exploit the link between predictive power and model explainability, we propose a simple model sampling approach that finds accurate models with the minimum set of features. Our principal idea is that features not contributing to increasing the predictive power should not be included in the model. Interestingly, the reduced set of features helps to increase model explainability, which is important to provide information to developers on features related to each module of the code which is more defect-prone. We evaluate our models on diverse projects within Jureczko datasets, and we show that (i) features that contribute most for finding best models may vary depending on the project and (ii) it is possible to find effective models that use few features leading to better understandability. We believe our results are useful to developers as we provide the specific software features that influence the defectiveness of selected projects.

Analysis of the Effectiveness of Large Language Model Feature in Source Code Defect Detection

An Evalutation of Programming Language Models' performance on Software Defect Detection

Impact of Large Language Models of Code on Fault Localization

Predicting Defective Visual Code Changes in a Multi-Language AAA Video Game Project

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

Deep Learning-Based Software Defect Prediction via Semantic Key Features of Source Code—Systematic Survey

An Approach to Semantic and Structural Features Learning for Software Defect Prediction

Understanding machine learning software defect predictions

Deep Learning for Just-In-Time Defect Prediction

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

DLAP: A Deep Learning Augmented Large Language Model Prompting Framework for Software Vulnerability Detection

Towards One Reusable Model for Various Software Defect Mining Tasks

Learning Semantic Features for Software Defect Prediction by Code Comments Embedding

Representation vs. Model: What Matters Most for Source Code Vulnerability Detection

VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language Models

Understanding the Effectiveness of Large Language Models in Detecting Security Vulnerabilities

Machine Learning-Powered Identification of Source Code Vulnerabilities

Large Language Models for Secure Code Assessment: A Multi-Language Empirical Study

Leveraging Large Language Models for Efficient Failure Analysis in Game Development

Outside the Comfort Zone: Analysing LLM Capabilities in Software Vulnerability Detection