Real-Time Energy Management for Hev Combining Naturalistic Driving Data and Deep Reinforcement Learning with High Generalization

Zemin Eitan Liu,Yong Li,Quan Zhou,Bin Shuai,Min Hua,Hongming Xu,Lubing Xu,Guikun Tan,Yanfei Li
DOI: https://doi.org/10.1016/j.apenergy.2024.124350
IF: 11.2
2024-01-01
Applied Energy
Abstract:Generalization to unseen environments is still a challenge for deep reinforcement learning (DRL)-based energy management strategies (EMSs). This paper proposes a real-time EMS with high generalization for a light-duty hybrid electric vehicle (HEV) from two perspectives: enhancing the generalization of the DRL algorithm and improving the accuracy of application scenario representation in the training environment. The enhanced DRL algorithm named ATSAC can adjust the update frequency and learning rate of SAC automatically to improve the generalization. With the advancement of naturalistic driving big data (NDBD) and machine learning, a specific training cycle is synthesized based on NDBD to reflect an urban-suburban real-world driving scenario more accurately. By the comprehensive comparison with SAC and TD3 based EMSs applied to unseen driving scenarios, the proposed algorithm achieves significant improvement in computational efficiency, optimality, and generalization. The results show that the computational efficiency of ATSAC is increased by 52.32% compared to SAC. The negative total reward (NTR) of ATSAC is decreased by 18.22% and 69.81% compared to SAC and TD3, respectively. Further analysis shows that the EMS trained through the synthetic driving cycle obtains 18.37% lower NTR than WLTC which demonstrates that the synthetic method can reflect the state transition probability of real-world driving scenarios better than WLTC.
What problem does this paper attempt to address?