Large-Scale Data Center Cooling Control Via Sample-Efficient Reinforcement Learning

Ni Mu,Xiao Hu,Qing-Shan Jia,Xu Zhu,Xiao He
DOI: https://doi.org/10.1109/case59546.2024.10711622
2024-01-01
Abstract:Cooling control in large-scale data centers (DCs) aims to minimize energy consumption, while maintaining suitable temperatures for IT equipment. In the multi-ACU joint control problem, the high dimensionality of the state and action space, along with time-costly simulation, make traditional control methods inefficient in obtaining the optimal policy. We propose a novel reinforcement learning (RL) framework, which first leverages a surrogate model to train a base RL policy, and then fine-tunes the base policy. Specifically, the use of the surrogate model can exponentially reduce the action space dimension, by treating all ACUs in the room as identical, and interpolating the state space based on physical mechanism. After the base policy for single-ACU control is derived from the surrogate model, we fine-tune it on each individual ACU, using Soft Actor-Critic (SAC) algorithm with balanced replay. The balanced replay technique mitigates the off-policy bootstrapping errors caused by inaccurate value estimation, while enabling efficient Q-value updating. Simulation results demonstrate that our framework significantly improves energy savings and temperature stability for large-scale DC cooling control, while requiring much fewer data samples, compared to existing state-of-the-art methods.
What problem does this paper attempt to address?