FS-DBoost: cross-server energy efficiency and performance prediction in cloud based on transfer regression

Guokai Wu,Huabin Wang,Weiwei Lin,Ruichao Mo,Xiaoxuan Luo
DOI: https://doi.org/10.1007/s10586-024-04370-1
2024-03-31
Cluster Computing
Abstract:The existing machine learning-based energy efficiency and performance models for the servers have achieved high prediction accuracy with the support of abundant training data. However, the differences in data distribution among heterogeneous servers in cloud data centers make it impractical to apply a machine learning-based model trained on one server to another directly. Meanwhile, collecting sufficient data for new servers from scratch incurs expensive time costs. To address these challenges, a transfer regression-based method for cross-server energy efficiency and performance prediction, named FS-DBoost, is proposed to reduce the overhead of establishing the two prediction models by transferring feature importance and distribution knowledge from data of existing servers. Initially, to transfer feature importance information, a sparse regression model is introduced to learn a feature selection vector from the source domain. Then, we design a boosting-based transfer algorithm, DBoost, to train the models for the target server using data with selected features. DBoost adjusts the total weight of the target domain based on the relatedness between source and target domains. The experimental results demonstrate that FS-DBoost achieves an average performance improvement of over 10% on real-world datasets compared to six transfer regression methods.
computer science, information systems, theory & methods
What problem does this paper attempt to address?