Data-efficient Performance Learning for Configurable Systems

Jianmei Guo,Dingyu Yang,Norbert Siegmund,Sven Apel,Atrisha Sarkar,Pavel Valov,Krzysztof Czarnecki,Andrzej Wasowski,Huiqun Yu
DOI: https://doi.org/10.1007/s10664-017-9573-6
IF: 3.762
2017-01-01
Empirical Software Engineering
Abstract:Many software systems today are configurable, offering customization of functionality by feature selection. Understanding how performance varies in terms of feature selection is key for selecting appropriate configurations that meet a set of given requirements. Due to a huge configuration space and the possibly high cost of performance measurement, it is usually not feasible to explore the entire configuration space of a configurable system exhaustively. It is thus a major challenge to accurately predict performance based on a small sample of measured system variants. To address this challenge, we propose a data-efficient learning approach, called DECART, that combines several techniques of machine learning and statistics for performance prediction of configurable systems. DECART builds, validates, and determines a prediction model based on an available sample of measured system variants. Empirical results on 10 real-world configurable systems demonstrate the effectiveness and practicality of DECART. In particular, DECART achieves a prediction accuracy of 90% or higher based on a small sample, whose size is linear in the number of features. In addition, we propose a sample quality metric and introduce a quantitative analysis of the quality of a sample for performance prediction.
What problem does this paper attempt to address?