Abstract:We present the first static-obstacle avoidance method for quadrotors using just an onboard, monocular event camera. Quadrotors are capable of fast and agile flight in cluttered environments when piloted manually, but vision-based autonomous flight in unknown environments is difficult in part due to the sensor limitations of traditional onboard cameras. Event cameras, however, promise nearly zero motion blur and high dynamic range, but produce a very large volume of events under significant ego-motion and further lack a continuous-time sensor model in simulation, making direct sim-to-real transfer not possible. By leveraging depth prediction as a pretext task in our learning framework, we can pre-train a reactive obstacle avoidance events-to-control policy with approximated, simulated events and then fine-tune the perception component with limited events-and-depth real-world data to achieve obstacle avoidance in indoor and outdoor settings. We demonstrate this across two quadrotor-event camera platforms in multiple settings and find, contrary to traditional vision-based works, that low speeds (1m/s) make the task harder and more prone to collisions, while high speeds (5m/s) result in better event-based depth estimation and avoidance. We also find that success rates in outdoor scenes can be significantly higher than in certain indoor scenes.

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper aims to address the problem of static obstacle avoidance for quadrotor drones during autonomous flight in unknown environments. Specifically, the authors propose a method for static obstacle avoidance using only a monocular event camera. Traditional cameras face significant challenges in autonomous flight in complex environments due to low frame rates, limited dynamic range, and motion blur. In contrast, event cameras offer advantages such as low latency and high dynamic range. However, the event stream output from event cameras generates a large amount of data during high-speed motion and lacks a continuous-time sensor model, making the transition from simulation to real-world application difficult. ### Main Contributions 1. **First proposed method for static obstacle avoidance based on event streams**: This is the first method to achieve static obstacle avoidance using a monocular event camera. 2. **Successful few-shot transfer from simulation to real-world application**: By utilizing depth prediction as a pre-training task, the event stream-based strategy effectively transfers from simulation to real-world application. 3. **High-speed (5m/s) obstacle avoidance in real environments**: Demonstrated that the drone can successfully avoid trees at high speeds (5m/s) using purely onboard computation, with improved obstacle avoidance performance as speed increases. 4. **Open-source code and data**: Provided open-source code and data for simulation, data collection, training, and testing. ### Method Overview 1. **Simulation setup and data collection**: Used the Flightmare simulator to generate grayscale and depth images, and converted these images to event streams using the Vid2E tool. 2. **Event data generation and representation**: Batch processed the event stream into binary event masks (BEM) within time windows to reduce the impact of discontinuities. 3. **Learning framework**: The perception module (D(θ)) uses a U-Net architecture to predict depth from BEM, and the velocity prediction module (V(Φ)) generates obstacle avoidance velocity commands based on the depth prediction results. 4. **Few-shot transfer and cross-platform adaptation**: Collected real-world event and depth image pairs to fine-tune the perception module, adapting it to different scenes and platforms. ### Experimental Results 1. **Simulation experiments**: Conducted 100 rolling trials in a simulated forest environment, showing a 60% success rate on medium-length trajectories (10m) and 15% of trials without collisions on long trajectories (60m). 2. **Hardware experiments**: Conducted experiments in indoor and outdoor environments using different event camera platforms. Results showed that the event stream-based obstacle avoidance method performed better at high speeds (5m/s) compared to traditional vision methods. More events generated at higher speeds improved depth prediction quality, enhancing obstacle avoidance performance. ### Conclusion This paper validates the feasibility of using only event cameras for static obstacle avoidance through simulation and real-world experiments. The study found that, unlike traditional vision methods, the event stream-based obstacle avoidance method performs better at high speeds because higher speeds generate more events, improving depth prediction quality. Additionally, the success rate in outdoor tests was higher than in indoor tests, possibly due to artificial light sources and background textures in indoor environments interfering with event pattern recognition.

Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor

Autonomous Flights in Dynamic Environments with Onboard Vision

Obstacle Avoidance of Monocular Quadrotors with Depth Estimation

Monocular Vision Based Obstacle Detection for Robot Navigation in Unstructured Environment

Dynamic obstacle avoidance for quadrotors with event cameras

Fusing Stereopsis & Corner Sparse Optical Flow for Real-time Obstacle Avoidance of Unmanned Aerial Vehicle

Monocular Depth Estimation for UAV Obstacle Avoidance

Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision

Monocular Vision-Based Obstacle Detection/Avoidance For Unmanned Aerial Vehicles

A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D camera

Seeing Through Pixel Motion: Learning Obstacle Avoidance from Optical Flow with One Camera

Real-time Quadrotor Navigation Through Planning in Depth Space in Unstructured Environments

Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance

Avoiding dynamic small obstacles with onboard sensing and computating on aerial robots

Event-Based Motion Capture System for Online Multi-Quadrotor Localization and Tracking

Visual path following with obstacle avoidance for quadcopters in indoor environments

Robust Vision-based Obstacle Avoidance for Micro Aerial Vehicles in Dynamic Environments

Real-Time Neuromorphic Navigation: Integrating Event-Based Vision and Physics-Driven Planning on a Parrot Bebop2 Quadrotor

Monocular Snapshot-based Sensing and Control of Hover, Takeoff, and Landing for a Low-cost Quadrotor

Monocular vision-based autonomous navigation system on a toy quadcopter in unknown environments

A Monocular Vision Obstacle Avoidance Method Applied to Indoor Tracking Robot