Learning to navigate efficiently and precisely in real environments

Guillaume Bono,Hervé Poirier,Leonid Antsfeld,Gianluca Monaci,Boris Chidlovskii,Christian Wolf
2024-01-26
Abstract:In the context of autonomous navigation of terrestrial robots, the creation of realistic models for agent dynamics and sensing is a widespread habit in the robotics literature and in commercial applications, where they are used for model based control and/or for localization and mapping. The more recent Embodied AI literature, on the other hand, focuses on modular or end-to-end agents trained in simulators like Habitat or AI-Thor, where the emphasis is put on photo-realistic rendering and scene diversity, but high-fidelity robot motion is assigned a less privileged role. The resulting sim2real gap significantly impacts transfer of the trained models to real robotic platforms. In this work we explore end-to-end training of agents in simulation in settings which minimize the sim2real gap both, in sensing and in actuation. Our agent directly predicts (discretized) velocity commands, which are maintained through closed-loop control in the real robot. The behavior of the real robot (including the underlying low-level controller) is identified and simulated in a modified Habitat simulator. Noise models for odometry and localization further contribute in lowering the sim2real gap. We evaluate on real navigation scenarios, explore different localization and point goal calculation methods and report significant gains in performance and robustness compared to prior work.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve efficient and accurate autonomous navigation of ground robots in practical environments. Specifically, the paper focuses on how to reduce the gap between simulation and reality (sim2real gap) through end - to - end training in the simulator, thereby improving the performance and robustness of the trained model on the real - robot platform. Traditional simulation methods often overlook the real - physical characteristics of robot motion, resulting in poor performance of the trained model in practical applications. The method proposed in this paper enables the trained agent to better adapt to the perception and execution challenges in the real environment by introducing a high - fidelity robot dynamics model and a noise model in the simulator. ### Key Point Summary: 1. **Reducing the sim2real Gap**: By introducing a more realistic robot dynamics model in the simulator, the gap between simulation and reality is reduced, and the performance of the model on the actual robot is improved. 2. **High - Fidelity Dynamics Model**: A second - order dynamics model is used to simulate the motion behavior of the robot, including the behavior of its low - level closed - loop controller, which helps the agent predict future states. 3. **Noise Model**: The introduction of odometry and localization noise models further reduces the sim2real gap. 4. **End - to - End Training**: The agent is directly trained in the simulator so that it can predict discretized velocity commands and execute these commands on the actual robot through closed - loop control. 5. **Experimental Verification**: The effects of different localization methods and target point calculation methods are evaluated in actual navigation scenarios, demonstrating a significant performance improvement compared to previous work. ### Main Contributions of the Paper: - A new method is proposed to reduce the sim2real gap by introducing high - fidelity dynamics and noise models in the simulator. - It is shown that through end - to - end training, the agent can achieve efficient and robust navigation on the real - robot platform. - The effectiveness of the method is verified through extensive experiments, especially the performance improvement in the real world. ### Formula Explanation: - **Dynamics Model**: - State Update Equation: \[ \begin{aligned} \dot{v}_{t,\tau+1} &= \dot{v}_{t,\tau} + \Delta\phi \left( f_v \delta_v - 2\zeta_v f_v \dot{v}_{t,\tau} \right) \\ \dot{w}_{t,\tau+1} &= \dot{w}_{t,\tau} + \Delta\phi \left( f_w \delta_w - 2\zeta_w f_w \dot{w}_{t,\tau} \right) \\ v_{t,\tau+1} &= v_{t,\tau} + \Delta\phi \dot{v}_{t,\tau+1} \\ w_{t,\tau+1} &= w_{t,\tau} + \Delta\phi \dot{w}_{t,\tau+1} \\ \theta_{t,\tau+1} &= \theta_{t,\tau} + \Delta\phi w_{t,\tau+1} \\ x_{t,\tau+1} &= x_{t,\tau} + \Delta\phi v_{t,\tau+1} \cos(\theta_{t,\tau+1}) \\ y_{t,\tau+1} &= y_{t,\tau} + \Delta\phi v_{t,\tau+1} \sin(\theta_{t,\tau+1}) \end{aligned} \] - Selection of Natural Frequency and Damping Coefficient: \[ \begin{aligned} \delta_v &= v^* - v \\ \langle f_v, \zeta_v \rangle &= \begin{cases}