Abstract:The modern energy management strategy (EMS) plays a vital role in the energy efficiency of the extended range electric vehicle. However, some modern strategies such as model predictive control (MPC) and dynamic programming (DP) have limited practical potential because they are subject to the pre-known environment information and noise interference. The reinforcement learning (RL)control strategy can be adopted as online control to interact with the vehicle and the environment. In this study, a novel auxiliary power unit (APU) charging strategy with multi-object optimization is proposed to achieve high fuel conversion efficiency while maintaining battery charging health. The state-of-the-art algorithm, Soft Actor-Critic (SAC), is applied to achieve better exploration of the possible APU behaviour and solve the sensitivity and poor convergence problems from the current RL studies. Its performance is further verified by the results of the Deep Deterministic Policy Gradient (DDPG) algorithm and DP. Three innovative targets are selected as the RL rewards for optimization: the engine fuel rate, SOC charging trajectory, and the battery charging rate (C-rate). The first adoption of the battery C-rate monitoring in RL-based energy management strategy helps extend the battery lifespan from excessive discharge. The comparative results show that the SAC had a 36% faster convergence speed than DDPG while providing a smoother and more stable action space. The fuel consumption with SAC also outplays that of DDPG by around 3%, which achieves almost 95% of the global optimization result. The successful deployment of the SAC algorithm as an EMS indicates its standout ability in dealing with wide-range actions and states with high randomness, revealing the practical potential compared with the existing RL strategies.

MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles

A Model Predictive Control Approach for USV Autonomous Cruising Via Disturbance Learning

Learning-Based Hierarchical Model Predictive Control for Drift Vehicles

Reinforcement Learning based on Scenario-tree MPC for ASVs

Neural Network Model-Based Reinforcement Learning Control for AUV 3-D Path Following

UAV Path Planning Employing MPC- Reinforcement Learning Method Considering Collision Avoidance

Model Predictive Control Based on State Space and Risk Augmentation for Unmanned Surface Vessel Trajectory Tracking

Reinforcement Learning Ship Autopilot: Sample efficient and Model Predictive Control-based Approach

Policy Learning for Nonlinear Model Predictive Control with Application to USVs

An Iterative Learning-based Integrated Motion Planning and Control Method for Autonomous Patrolling of Unmanned Surface Vehicles

A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

Incorporating Recurrent Reinforcement Learning into Model Predictive Control for Adaptive Control in Autonomous Driving

Autonomous Wheel Loader Navigation Using Goal-Conditioned Actor-Critic MPC

Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle

Comparison of Linear and Nonlinear Model Predictive Control in Path Following of Underactuated Unmanned Surface Vehicles

Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space

Dynamic Obstacle Avoidance for USVs Using Cross-Domain Deep Reinforcement Learning and Neural Network Model Predictive Controller

VLM-MPC: Vision Language Foundation Model (VLM)-Guided Model Predictive Controller (MPC) for Autonomous Driving

Learning safety in model-based Reinforcement Learning using MPC and Gaussian Processes

Adaptive Stochastic Nonlinear Model Predictive Control with Look-ahead Deep Reinforcement Learning for Autonomous Vehicle Motion Control

Energy management strategy via maximum entropy reinforcement learning for an extended range logistics vehicle