Power Management for a Plug-in Hybrid Electric Vehicle Based on Reinforcement Learning with Continuous State and Action Spaces

Yuecheng Li,Hongwen He,Jiankun Peng,Hailong Zhang
DOI: https://doi.org/10.1016/j.egypro.2017.12.629
2017-01-01
Energy Procedia
Abstract:This paper presents a power management strategy for a plug-in hybrid electric vehicle based on reinforcement learning with continuous state and action spaces (Actor-Critic method, which has been highly successful in artificial intelligence field). Compared with discrete optimal methods, such as dynamic programming (DP) and Q-learning, the continuous method owns great potential in complex environments (much more sate variables) without worrying curse of dimensionality. A vehicle model is constructed for application of optimal algorithms, and power management problem is reformulated in accordance with Actor-Critic method. In order to guarantee the training process of proposed method to be quick and stable, stochastic gradient descent and experience replay is adopted. Both AC based method and DP based method are simulated on the same driving cycle. For one driving cycle, the total cost of a trained AC based method is only 2.76% higher than that of DP, while saving 88.7% of calculation time than that DP takes.
What problem does this paper attempt to address?