Abstract:Safety is a critical issue in learning-based robotic and autonomous systems as learned information about their environments is often unreliable and inaccurate. In this paper, we propose a risk-aware motion control tool that is robust against errors in learned distributional information about obstacles moving with unknown dynamics. The salient feature of our model predictive control (MPC) method is its capability of limiting the risk of unsafety even when the true distribution deviates from the distribution estimated by Gaussian process (GP) regression, within an ambiguity set. Unfortunately, the distributionally robust MPC problem with GP is intractable because the worst-case risk constraint involves an infinite-dimensional optimization problem over the ambiguity set. To remove the infinite-dimensionality issue, we develop a systematic reformulation approach exploiting modern distributionally robust optimization techniques. The performance and utility of our method are demonstrated through simulations using a nonlinear car-like vehicle model for autonomous driving.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in mobile robots and autonomous systems, due to the uncertainty and dynamics of environmental information, learning - based decision - making tools often lead to unreliable and inaccurate learned information, which may cause safety problems during the control process. Specifically, the paper focuses on developing an optimization method to achieve robust control of moving obstacles and ensure safety even when there are errors in the learned obstacle motion information. The paper proposes a method based on model predictive control (MPC), which can limit safety risks when there are deviations between the estimated probability distribution and the real distribution, especially in the case of obstacle motion with unknown dynamics.
### Main contributions of the paper
1. **Enhanced robustness**: By introducing distributionally robust optimization techniques, the method proposed in the paper can still ensure the safety of the system when there are errors in the learned distribution information.
2. **Solution to infinite - dimensional optimization problems**: The paper proposes a systematic reformulation method, which transforms the distributionally robust MPC problem originally involving infinite - dimensional optimization problems into a finite - dimensional optimization problem, making it possible to be efficiently solved using existing nonlinear programming algorithms.
3. **Verification in practical applications**: Through simulation experiments, the paper shows the performance of the proposed method in a nonlinear vehicle model, especially in the obstacle - avoidance competition, proving its effectiveness and robustness in practical applications.
### Key technical points
- **Gaussian process regression (GP)**: Used to estimate the probability distribution of the future positions of obstacles.
- **Conditional value - at - risk (CVaR)**: Used to quantify the risk beyond the confidence level, especially suitable for evaluating rare but unsafe events.
- **Wasserstein distance**: Used to define the ambiguity set of distributions, thereby constructing distributionally robust optimization problems.
### Experimental results
- **Safety**: In various situations, the method proposed in the paper can successfully avoid collisions, even when there are large errors in the learned obstacle motion information.
- **Performance comparison**: Compared with the non - robust version of MPC (SAA - MPC), the method proposed in the paper shows significant advantages in terms of safety. Although it may deviate far from the reference trajectory in some cases, it can generally complete the task without collisions.
- **Parameter selection**: Through experiments, the paper suggests choosing an appropriate Wasserstein ambiguity set radius (such as \( \theta = 5\times10^{-5} \)), in order to achieve a good balance between safety and cost.
In conclusion, by introducing distributionally robust optimization techniques, this paper solves the safety problems caused by learning errors in learning - based motion control, providing new ideas and methods for the safe control of mobile robots and autonomous systems.