Learning Model Predictive Control for Connected Autonomous Vehicles

Hassan Jafarzadeh,Cody Fleming
DOI: https://doi.org/10.48550/arXiv.1908.02879
2019-08-08
Abstract:A Learning Model Predictive Controller (LMPC) is presented and tailored to platooning and Connected Autonomous Vehicles (CAVs) applications. The proposed controller builds on previous work on nonlinear LMPC, adapting its architecture and extending its capability to (a) handle dynamic environments and (b) account for data-driven decision variables that derive from an unknown or unknowable function. The paper presents the control design approach, and shows how to recursively construct an outer loop candidate trajectory and an inner iterative LMPC controller that converges to an optimal strategy over both model-driven and data-driven variables. Simulation results show the effectiveness of the proposed control logic.
Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to design a Model Predictive Control (MPC) that can handle dynamic environments and uncertain communication channels in vehicle platooning and Connected Autonomous Vehicles (CAVs). Specifically, the paper proposes a Learning Model Predictive Control (LMPC) method, aiming at: 1. **Handling dynamic environments**: Traditional MPC usually assumes that the environment is static, but in practical applications, the environment around the vehicle is constantly changing. The method proposed in the paper can adapt to these dynamic changes, such as the movement of other vehicles, changes in road conditions, etc. 2. **Considering data - driven decision variables**: In many cases, some variables or objective functions are generated by unknown or unknowable functions, such as variables generated by artificial neural networks (ANN). The method proposed in the paper can handle these data - driven variables and incorporate them into the control strategy. 3. **Dealing with uncertain communication channels**: In vehicle platooning, the quality of wireless communication is affected by multiple factors, such as multipath reflection, building occlusion, etc. The method proposed in the paper optimizes the vehicle's motion strategy by predicting the quality of the communication channel to improve the overall performance and safety of the system. ### Main contributions of the paper 1. **Proposing a new LMPC problem**: This problem can handle decision variables or objective functions generated by unknown or unknowable functions, such as variables generated by artificial neural networks. 2. **Extending LMPC**: Making it able to handle dynamic environments and time - evolving constraints while maintaining computational feasibility. ### Specific implementation - **Short - Range LMPC (SR - LMPC)**: By shortening the planning time horizon, the algorithm can converge to the optimal solution more quickly in a dynamic environment. SR - LMPC performs multiple inner - loop iterations in each outer - loop iteration, and each iteration uses data - driven decision variables to improve the current trajectory. - **Considering uncertain communication channels**: By introducing communication delay as part of the optimization objective, the trajectory generated by the model is forced to maintain high communication quality. In addition, dynamic constraints are defined to represent the areas where communication loss occurs, and these constraints are updated in each inner - loop iteration. - **Converting Mixed - Integer Nonlinear Programming (MINLP) to Nonlinear Programming (NLP)**: By converting the constraints of binary variables to nonlinear constraints, the solution process becomes more efficient. ### Application scenarios The paper verifies the effectiveness of the proposed method through a specific scenario: in vehicle platooning, the front vehicle and the rear vehicle communicate through the V2V network, but the communication performance is negatively affected by the bridge passage. In this case, the rear vehicle needs to adjust its longitudinal speed according to the state of the front vehicle and the communication quality to avoid communication interruption. The experimental results show that the proposed SR - LMPC method can effectively deal with this uncertain communication environment and improve the overall performance and safety of vehicle platooning. ### Conclusion The Learning Model Predictive Control method (LMPC) proposed in the paper performs excellently in handling dynamic environments and uncertain communication channels and can significantly improve the performance and safety of Connected Autonomous Vehicles (CAVs) in vehicle platooning.