Reinforcement Learning-Based Energy Management for Fuel Cell Electrical Vehicles Considering Fuel Cell Degradation

Qilin Shuai,Yiheng Wang,Zhengxiong Jiang,Qingsong Hua
DOI: https://doi.org/10.3390/en17071586
IF: 3.2
2024-03-27
Energies
Abstract:The service life and fuel consumption of fuel cell system (FCS) are the main factors limiting the commercialization of fuel cell electric vehicles (FCEV). Effective energy management strategies (EMS) can reduce fuel consumption during the cycle and prolong the service life of FCS. This paper proposes an energy management strategy based on the deep reinforcement learning (DRL) algorithm, deep Q-learning (DQL). Considering the unstable performance of conventional DQL during the training process, a new algorithm called Double Deep Q Learning (DDQL) is introduced. The DDQL uses a target evaluation network to evaluate output actions and a delayed update strategy to improve the convergence and stability of DRL. This article trains the strategy using UDDS cycle, tests it using combined cycles UDDS-WLTC-NEDC, and compares it with traditional ECM-based EMS. The results demonstrate that under the combined cycle, the strategy effectively reduced FCS voltage degradation by 50%, maintained fuel economy, and ensured consistency between the initial and final state of charge (SOC) of LIB.
energy & fuels
What problem does this paper attempt to address?
The paper attempts to address the issues of the service life and fuel consumption of Fuel Cell Systems (FCS), which are the main obstacles to the commercialization of Fuel Cell Electric Vehicles (FCEV). An effective Energy Management Strategy (EMS) can reduce fuel consumption during the cycling process and extend the service life of the FCS. This paper proposes an EMS based on a Deep Reinforcement Learning (DRL) algorithm, specifically Deep Q-Learning (DQL). Considering the unstable performance of traditional DQL during the training process, a new algorithm—Double Deep Q-Learning (DDQL)—is introduced. DDQL improves the convergence and stability of DRL by using a target evaluation network to assess output actions and delaying policy updates. Researchers trained the strategy using the UDDS cycle, tested it with a combined cycle of UDDS-WLTC-NEDC, and compared it with traditional ECM-based EMS. Experimental results show that under the combined cycle, the strategy effectively reduces FCS voltage degradation by 50%, maintains fuel economy, and ensures the consistency of the initial and final State of Charge (SOC) of the lithium-ion battery.