Output-Sampled Model Predictive Path Integral Control (o-MPPI) for Increased Efficiency

Leon,Santosh Devasia
2023-09-23
Abstract:The success of the model predictive path integral control (MPPI) approach depends on the appropriate selection of the input distribution used for sampling. However, it can be challenging to select inputs that satisfy output constraints in dynamic environments. The main contribution of this paper is to propose an output-sampling-based MPPI (o-MPPI), which improves the ability of samples to satisfy output constraints and thereby increases MPPI efficiency. Comparative simulations and experiments of dynamic autonomous driving of bots around a track are provided to show that the proposed o-MPPI is more efficient and requires substantially (20-times) less number of rollouts and (4-times) smaller prediction horizon when compared with the standard MPPI for similar success rates. The supporting video for the paper can be found at <a class="link-external link-https" href="https://youtu.be/snhlZj3l5CE" rel="external noopener nofollow">this https URL</a>.
Systems and Control
What problem does this paper attempt to address?
The paper mainly addresses the issue of the Model Predictive Path Integral Control (MPPI) method's difficulty in effectively meeting output constraints in dynamic environments and proposes an improvement. Specifically, the main contributions of the paper are as follows: 1. **Proposed Output Sampling MPPI (o-MPPI)**: Traditional MPPI methods sample through input distributions to predict system behavior, but this approach struggles to ensure that outputs meet specific constraints. Therefore, the paper proposes the o-MPPI method, which directly samples trajectories from the output space and uses an inverse model to map these outputs back to the input space. This method improves the ability of sampled trajectories to meet output constraints, thereby enhancing the overall efficiency of MPPI. 2. **Improved Efficiency**: Through comparative experiments, it is demonstrated that o-MPPI requires 20 times fewer samples and reduces the prediction time window by 4 times compared to standard MPPI while achieving similar success rates. 3. **Simulation and Experimental Validation**: The paper validates the effectiveness of o-MPPI in autonomous driving scenarios through simulations and experiments, particularly in the presence of moving obstacles. Experimental results show that o-MPPI can more efficiently avoid obstacles and successfully overtake slow-moving vehicles. In summary, the goal of the paper is to improve the performance of the MPPI algorithm in handling output constraints in dynamic environments. By introducing the concept of output space sampling and the application of inverse models, this goal is achieved. This method not only improves sampling efficiency but also enables MPPI to better adapt to complex dynamic environments.