Abstract:Summary Continuous integration, at its core, includes a set of practices that aim to prevent and reduce the cost of software integration issues by merging working software copies often. Regression testing is considered a good practice in software development with continuous integration, which ensures that code changes are not negatively affecting software functionality. As, nowadays, software development is carried out iteratively, with small code increments continuously developed and regression tested, it is of critical importance that continuous regression testing is time efficient. However, in practice, regression testing is often long lasting and faces scalability problems as software grows larger or as software changes are made more frequently. One contributing factor to these issues is test redundancy, which causes the same software functionality being tested multiple times across a test suite. In large‐scale software, especially highly configurable software, redundancy in continuous regression testing can significantly grow the size of test suites and negatively affect the cost effectiveness of continuous integration. This paper presents a practical learning algorithm for optimizing continuous integration testing by reducing ineffective test redundancy in regression suites. The novelty of the algorithm lies in learning and predicting the fault‐detection effectiveness of continuous integration tests using historical test records and combining this information with coverage‐based redundancy metrics. The goal is to identify ineffective redundancy, which is maximally reduced in the resulting regression test suite, thus reducing test time and improving the performance of continuous integration. We apply and evaluate the algorithm in two industrial projects of continuous integration. The results show that the proposed algorithm can improve the efficiency of continuous integration practice in terms of decreasing test execution time by 38% on average compared to the industry practice of our case study and by 40% on average compared to the retest‐all approach. The results further demonstrate no significant reduction in fault‐detection effectiveness of continuous regression testing. This suggests that the proposed algorithm contributes to the state of the practice in the continuous integration development and testing of highly configurable systems.

The Why, When, What, and How about Predictive Continuous Integration: A Simulation-Based Investigation

Predictive Models in Software Engineering: Challenges and Opportunities

Study of the Predictive Mechanism with Big Data-Driven Lean Manufacturing and Six Sigma Methodology

A Procedure to Continuously Evaluate Predictive Performance of Just-In-Time Software Defect Prediction Models During Software Development

An Experimental Evaluation of Imbalanced Learning and Time-Series Validation in the Context of CI/CD Prediction

On combining commit grouping and build skip prediction to reduce redundant continuous integration activity

Could We Predict the Result of a Continuous Integration Build? An Empirical Study

Uncovering the Benefits and Challenges of Continuous Integration Practices

Built to Last or Built Too Fast? Evaluating Prediction Models for Build Times

On the validity of retrospective predictive performance evaluation procedures in just-in-time software defect prediction

A Systematic Survey of Just-in-Time Software Defect Prediction

Adaptive Reward Computation in Reinforcement Learning-Based Continuous Integration Testing

Does Socio-Technical Congruence Have an Effect on Continuous Integration Build Failures? An Empirical Study on 10 GitHub Projects

Security Development Lifecycle-Based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization

Deep Learning Based Continuous Integration and Continuous Delivery Software Defect Prediction with Effective Optimization Strategy

A learning algorithm for optimizing continuous integration development and testing practice

JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction

Implications of the Use of Artificial Intelligence Predictive Models in Health Care Settings : A Simulation Study

Practitioners' Challenges and Perceptions of CI Build Failure Predictions at Atlassian

ConPredictor: Concurrency Defect Prediction in Real-World Applications

Deep Incremental Learning of Imbalanced Data for Just-In-Time Software Defect Prediction