Abstract:Given the rise in demand for cloud computing in the modern era, the effectiveness of resource utilization is eminent to decrease energy footprint and achieve economic services. With the emerging machine learning and artificial intelligence techniques to model and predict, it is essential to explore a principal method that provides the best solution for the accurate provisioning of forthcoming requests in a cloud data center. Recent studies used machine learning and other advanced analytics to predict resource usage; however, these do not consider long-range dependencies in the time series, which is essential to capture for better prediction. Further, they show limitations in handling noise, missing values, and outliers in datasets. In this paper, we explored the problem by studying three techniques that enabled us to answer improvements in short-term forecasting of physical machines' resource usage if the above factors are considered. We evaluated the predictions using Transformer and Informer deep learning models that cover the above aspects and compared them with the Long short-term memory (LSTM) model. We used a real-world Google cluster trace usage dataset and employed Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH) algorithm to select heterogeneous machines. The evaluation of the three models depicts that the Transformer architecture that considers long-range dependencies in time series and shortcomings with datasets shows improvement in forecasting with 14.2% reduction in RMSE than LSTM. However, LSTM shows better results for some machines than the Transformer, which depicts the importance of input sequence order. The Informer model, which considers both dependencies and is a hybrid of LSTM and Transformer, outperformed both models with 21.7% from LSTM and 20.8% from Transformer reduction in RMSE. The results also depict Informer model consistently performs better than the other models across all subsets of the dataset. Our study proves that considering long-range dependencies and sequence ordering for resource usage time series improves the prediction.

A comparison of forecasting models for the resource usage of MapReduce applications

Power Load Prediction Model Based on Long Short Term Memory and Sparrow Search Algorithm

Reliable Estimation of Execution Time of MapReduce Program

Comparison and Improvement of Hadoop MapReduce Performance Prediction Models in the Private Cloud.

Comparative analysis of cloud resources forecasting using deep learning techniques based on VM workload traces

Conventional models and artificial intelligence-based models for energy consumption forecasting: A review

An efficient forecasting approach for resource utilization in cloud data center using CNN-LSTM model

Comparative study of long short-term memory (LSTM), bidirectional LSTM, and traditional machine learning approaches for energy consumption prediction

Long Short Term Memory Recurrent Neural Network (LSTM-RNN) Based Workload Forecasting Model For Cloud Datacenters

Application of Neural Network and Support Vector Machines to Power System Short-term Load Forecasting.

A Hybrid Grey System Model Based on Stacked Long Short-Term Memory Layers and Its Application in Energy Consumption Forecasting

Short‐term power load forecasting based on multi‐layer bidirectional recurrent neural network

Comparative Study of LSTM and ANN Models for Power Consumption Prediction of Variable Refrigerant Flow (VRF) Systems in Buildings

Deep Learning-Based Power Usage Forecast Modeling and Evaluation

An overview and comparative analysis of Recurrent Neural Networks for Short Term Load Forecasting

Short-term Load Forecasting Using a Long Short-Term Memory Network.

Regression modeling for enterprise electricity consumption: A comparison of recurrent neural network and its variants

Energy Consumption Forecasting in a University Office by Artificial Intelligence Techniques: An Analysis of the Exogenous Data Effect on the Modeling

Short-term power load forecasting based on sparrow search algorithm-variational mode decomposition and attention-long short-term memory

A Regressive Convolution Neural network and Support Vector Regression Model for Electricity Consumption Forecasting

An effective deep learning architecture leveraging BIRCH clustering for resource usage prediction of heterogeneous machines in cloud data center