Abstract:In the context of autonomous navigation of terrestrial robots, the creation of realistic models for agent dynamics and sensing is a widespread habit in the robotics literature and in commercial applications, where they are used for model based control and/or for localization and mapping. The more recent Embodied AI literature, on the other hand, focuses on modular or end-to-end agents trained in simulators like Habitat or AI-Thor, where the emphasis is put on photo-realistic rendering and scene diversity, but high-fidelity robot motion is assigned a less privileged role. The resulting sim2real gap significantly impacts transfer of the trained models to real robotic platforms. In this work we explore end-to-end training of agents in simulation in settings which minimize the sim2real gap both, in sensing and in actuation. Our agent directly predicts (discretized) velocity commands, which are maintained through closed-loop control in the real robot. The behavior of the real robot (including the underlying low-level controller) is identified and simulated in a modified Habitat simulator. Noise models for odometry and localization further contribute in lowering the sim2real gap. We evaluate on real navigation scenarios, explore different localization and point goal calculation methods and report significant gains in performance and robustness compared to prior work.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve efficient and accurate autonomous navigation of ground robots in practical environments. Specifically, the paper focuses on how to reduce the gap between simulation and reality (sim2real gap) through end - to - end training in the simulator, thereby improving the performance and robustness of the trained model on the real - robot platform. Traditional simulation methods often overlook the real - physical characteristics of robot motion, resulting in poor performance of the trained model in practical applications. The method proposed in this paper enables the trained agent to better adapt to the perception and execution challenges in the real environment by introducing a high - fidelity robot dynamics model and a noise model in the simulator. ### Key Point Summary: 1. **Reducing the sim2real Gap**: By introducing a more realistic robot dynamics model in the simulator, the gap between simulation and reality is reduced, and the performance of the model on the actual robot is improved. 2. **High - Fidelity Dynamics Model**: A second - order dynamics model is used to simulate the motion behavior of the robot, including the behavior of its low - level closed - loop controller, which helps the agent predict future states. 3. **Noise Model**: The introduction of odometry and localization noise models further reduces the sim2real gap. 4. **End - to - End Training**: The agent is directly trained in the simulator so that it can predict discretized velocity commands and execute these commands on the actual robot through closed - loop control. 5. **Experimental Verification**: The effects of different localization methods and target point calculation methods are evaluated in actual navigation scenarios, demonstrating a significant performance improvement compared to previous work. ### Main Contributions of the Paper: - A new method is proposed to reduce the sim2real gap by introducing high - fidelity dynamics and noise models in the simulator. - It is shown that through end - to - end training, the agent can achieve efficient and robust navigation on the real - robot platform. - The effectiveness of the method is verified through extensive experiments, especially the performance improvement in the real world. ### Formula Explanation: - **Dynamics Model**: - State Update Equation: \[ \begin{aligned} \dot{v}_{t,\tau+1} &= \dot{v}_{t,\tau} + \Delta\phi \left( f_v \delta_v - 2\zeta_v f_v \dot{v}_{t,\tau} \right) \\ \dot{w}_{t,\tau+1} &= \dot{w}_{t,\tau} + \Delta\phi \left( f_w \delta_w - 2\zeta_w f_w \dot{w}_{t,\tau} \right) \\ v_{t,\tau+1} &= v_{t,\tau} + \Delta\phi \dot{v}_{t,\tau+1} \\ w_{t,\tau+1} &= w_{t,\tau} + \Delta\phi \dot{w}_{t,\tau+1} \\ \theta_{t,\tau+1} &= \theta_{t,\tau} + \Delta\phi w_{t,\tau+1} \\ x_{t,\tau+1} &= x_{t,\tau} + \Delta\phi v_{t,\tau+1} \cos(\theta_{t,\tau+1}) \\ y_{t,\tau+1} &= y_{t,\tau} + \Delta\phi v_{t,\tau+1} \sin(\theta_{t,\tau+1}) \end{aligned} \] - Selection of Natural Frequency and Damping Coefficient: \[ \begin{aligned} \delta_v &= v^* - v \\ \langle f_v, \zeta_v \rangle &= \begin{cases}

Learning to navigate efficiently and precisely in real environments

Situated Multimodal Control of a Mobile Robot: Navigation through a Virtual Environment

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation

Out of the Box: Embodied Navigation in the Real World

Navigating to objects in the real world

Multi-Object Navigation in real environments using hybrid policies

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Learning to Navigate in Complex Environments

Learning Visual Navigation System in Simulation for Autonomous Ground Vehicles in Real World

Sim-to-Real Strategy for Spatially Aware Robot Navigation in Uneven Outdoor Environments

Learning Autonomous Navigation in Unmapped and Unknown Environments

Building Intelligent Autonomous Navigation Agents

Learning 3D Navigation Protocols on Touch Interfaces with Cooperative Multi-agent Reinforcement Learning

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Real-Time Navigation For A Mobile Robot Based On The Autonomous Behavior Agent

An Open-source Sim2Real Approach for Sensor-independent Robot Navigation in a Grid

The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation

Learning to Navigate using Visual Sensor Networks

Learning Dynamic Cognitive Map with Autonomous Navigation

Learning World Transition Model for Socially Aware Robot Navigation

Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments