Heterogeneity-aware Proactive Elastic Resource Allocation for Serverless Applications
Binbin Feng,Zhijun Ding,Xiaobo Zhou,Changjun Jiang
DOI: https://doi.org/10.1109/tsc.2024.3350711
IF: 11.019
2024-01-01
IEEE Transactions on Services Computing
Abstract:Serverless computing is a popular cloud computing model that offers on-demand resource allocation and pay-as-you-go application execution. However, there are still challenges in allocating resources for workflow applications: inaccurate and inefficient resource estimation, high-latency inter-function communication, and long server readiness time. Therefore, we propose the heterogeneity-aware Proactive serverLess wOrkflow Elastic Allocation method (PLOEA) to address these issues and optimize infrastructure costs for cloud service providers (CSPs) while meeting the diverse needs of developers. Specifically, we propose a resource configuration estimation method for heterogeneous workflow applications that builds an ensemble multi-task expert classifier to analyze individual and common resource usage patterns, ensuring estimation accuracy and efficiency. Further, we propose a group allocation strategy for multiple applications that optimizes the spatiotemporal distribution of instances by considering the allocation urgency, communication affinity between functions, and the multi-core architecture of servers. Furthermore, we present a proactive server elastic scaling method that senses workload features, including workload level, trend, and magnitude changes, and combines them with CSP's attention differences to guide the server scaling size. Finally, experiments based on public datasets prove that PLOEA provides better service quality and cost efficiency than existing methods.
computer science, information systems, software engineering