Double-Q Learning-Based Dvfs For Multi-Core Real-Time Systems

Hui Huang,Man Lin,Qingchen Zhang
DOI: https://doi.org/10.1109/iThings-GreenCom-CPSCom-SmartData.2017.83
2017-01-01
Abstract:The Q-learning based DVFS selection algorithm has been used to lower energy consumption for system-level power management. However, this algorithm normally suffers from overestimation since it greedily uses maximum action value to approximate expected value, consequently fails to select the most appropriate DVFS method for real-time systems. In this article, we propose a Double-Q learning based DVFS selection algorithm to reduce energy consumption. In our scheme, instead of approximating actual action values with only one estimator, it implements Double-Q learning that applies two estimators to efficiently reduce overestimation, leading to an energy-aware scheme that can maintain a relatively stable and sufficient performance in action selection. We evaluate the performance of the proposed scheme through simulated data sets. Results demonstrate that our scheme can save more energy than the Q-learning based scheme while adapting to various system conditions and provide a more stable and accurate DVFS policy selection mechanism for multi-core real-time systems.
What problem does this paper attempt to address?