Abstract:The path tracking control system is a crucial component for autonomous vehicles; it is challenging to realize accurate tracking control when approaching a wide range of uncertain situations and dynamic environments, particularly when such control must perform as well as, or better than, human drivers. While many methods provide state-of-the-art tracking performance, they tend to emphasize constant PID control parameters, calibrated by human experience, to improve tracking accuracy. A detailed analysis shows that PID controllers inefficiently reduce the lateral error under various conditions, such as complex trajectories and variable speed. In addition, intelligent driving vehicles are highly non-linear objects, and high-fidelity models are unavailable in most autonomous systems. As for the model-based controller (MPC or LQR), the complex modeling process may increase the computational burden. With that in mind, a self-optimizing, path tracking controller structure, based on reinforcement learning, is proposed. For the lateral control of the vehicle, a steering method based on the fusion of the reinforcement learning and traditional PID controllers is designed to adapt to various tracking scenarios. According to the pre-defined path geometry and the real-time status of the vehicle, the interactive learning mechanism, based on an RL framework (actor–critic—a symmetric network structure), can realize the online optimization of PID control parameters in order to better deal with the tracking error under complex trajectories and dynamic changes of vehicle model parameters. The adaptive performance of velocity changes was also considered in the tracking process. The proposed controlling approach was tested in different path tracking scenarios, both the driving simulator platforms and on-site vehicle experiments have verified the effects of our proposed self-optimizing controller. The results show that the approach can adaptively change the weights of PID to maintain a tracking error (simulation: within ±0.071 m; realistic vehicle: within ±0.272 m) and steering wheel vibration standard deviations (simulation: within ±0.04°; realistic vehicle: within ±80.69°); additionally, it can adapt to high-speed simulation scenarios (the maximum speed is above 100 km/h and the average speed through curves is 63–76 km/h).

Optimal Lateral Path-Tracking Control of Vehicles with Partial Unknown Dynamics Via DPG-Based Reinforcement Learning Methods

Learning-Based Hierarchical Model Predictive Control for Drift Vehicles

Path Following for Autonomous Ground Vehicle Using DDPG Algorithm: A Reinforcement Learning Approach

Adaptive Learning-Based Path-Tracking Control for Unknown Vehicle Systems under Performance Optimization

Cooperative Path Following Control in Autonomous Vehicles Graphical Games: A Data-Based Off-Policy Learning Approach

Learning-Based MPC Controller for Drift Control of Autonomous Vehicles

Perceptual Interaction-Based Path Tracking Control of Autonomous Vehicles under DoS Attacks: A Reinforcement Learning Approach

Path Tracking Control of Autonomous Ground Vehicles Via Model Predictive Control and Deep Deterministic Policy Gradient Algorithm

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

A Novel Vehicle Platoon Following Controller Based on Deep Deterministic Policy Gradient Algorithms

Self-Optimizing Path Tracking Controller for Intelligent Vehicles Based on Reinforcement Learning

Distributed Drive Autonomous Vehicle Trajectory Tracking Control Based on Multi-Agent Deep Reinforcement Learning

Research on Path Tracking Control Based on Optimal Look-Ahead Points

Parallel Cross Entropy Policy Gradient Adaptive Dynamic Programming for Optimal Tracking Control of Discrete-Time Nonlinear Systems

Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-minimum Phase Systems

Continuous‐time receding‐horizon reinforcement learning and its application to path‐tracking control of autonomous ground vehicles

Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients

A Novel Robust H∞ Control Approach Based on Vehicle Lateral Dynamics for Practical Path Tracking Applications

Longitudinal robust dynamic programming control for driving robot vehicles with performance self-learning

Trajectory tracking control of wheeled mobile robot based on improved LSTM-DDPG algorithm

Deep reinforcement learning-based drift parking control of automated vehicles