Stochastic model predictive control for energy management of power-split plug-in hybrid electric vehicles based on reinforcement learning

Zheng Chen,Hengjie Hu,Yitao Wu,Yuanjian Zhang,Guang Li,Yonggang Liu
DOI: https://doi.org/10.1016/j.energy.2020.118931
IF: 9
2020-11-01
Energy
Abstract:In this paper, a stochastic model predictive control (MPC) method based on reinforcement learning is proposed for energy management of plug-in hybrid electric vehicles (PHEVs). Firstly, the power transfer of each component in a power-split PHEV is described in detail. Then an effective and convergent reinforcement learning controller is trained by the Q-learning algorithm according to the driving power distribution under multiple driving cycles. By constructing a multi-step Markov velocity prediction model, the reinforcement learning controller is embedded into the stochastic MPC controller to determine the optimal battery power in predicted time domain. Numerical simulation results verify that the proposed method achieves superior fuel economy that is close to that by stochastic dynamic programming method. In addition, the effective state of charge tracking in terms of different reference trajectories highlight that the proposed method is effective for online application requiring a fast calculation speed.
energy & fuels,thermodynamics
What problem does this paper attempt to address?
This paper aims to solve the energy management problem of plug - in hybrid electric vehicles (PHEVs), especially how to optimize energy distribution through the stochastic model predictive control (SMPC) method based on reinforcement learning to reduce fuel consumption and improve energy efficiency. Specifically, the researchers proposed an SMPC method combined with the reinforcement learning algorithm (Q - learning) to predict the optimal battery power under various driving cycle conditions, and verified the effectiveness of this method through numerical simulation. In addition, the state - of - charge (SOC) tracking effect of this method under different reference trajectories also shows that it is suitable for online applications requiring fast calculation speed. The main contribution of the paper lies in applying reinforcement learning to SMPC to realize online rolling control optimization, and establishing a reinforcement learning controller combined with speed prediction, which provides effective support for the machine - learning - based energy management strategy of PHEVs.