High Concurrency Response Strategy based on Kubernetes Horizontal Pod Autoscaler

Qizheng Huo,Chengyang Li,Shaonan Li,Yongqiang Xie,Zhongbo Li
DOI: https://doi.org/10.1088/1742-6596/2451/1/012001
2023-03-09
Journal of Physics: Conference Series
Abstract:The automatic scaling of resources in the cluster can reduce costs while satisfying resource usage. This is one of the salient features of cloud computing. The current service scenarios are diverse, and the default policy of Horizontal Pod Autoscaling (HPA) is difficult to meet the demand. In the face of high concurrent traffic, the default strategy has the problem of long expansion time and slow speed. Based on HPA, this paper proposes the High Concurrency Response Strategy. It can cope with high concurrency usage scenarios and ensure that services can still be accessed in the face of large traffic concurrency. Compared with the default HPA, the customized HPA strategy reduces the number of service requests by over 20 ms or 19%.
What problem does this paper attempt to address?