Abstract:Event cameras are bio-inspired sensors that capture the per-pixel intensity changes asynchronously and produce event streams encoding the time, pixel position, and polarity (sign) of the intensity changes. Event cameras possess a myriad of advantages over canonical frame-based cameras, such as high temporal resolution, high dynamic range, low latency, etc. Being capable of capturing information in challenging visual conditions, event cameras have the potential to overcome the limitations of frame-based cameras in the computer vision and robotics community. In very recent years, deep learning (DL) has been brought to this emerging field and inspired active research endeavors in mining its potential. However, there is still a lack of taxonomies in DL techniques for event-based vision. We first scrutinize the typical event representations with quality enhancement methods as they play a pivotal role as inputs to the DL models. We then provide a comprehensive survey of existing DL-based methods by structurally grouping them into two major categories: 1) image/video reconstruction and restoration; 2) event-based scene understanding and 3D vision. We conduct benchmark experiments for the existing methods in some representative research directions, i.e., image reconstruction, deblurring, and object recognition, to identify some critical insights and problems. Finally, we have discussions regarding the challenges and provide new perspectives for inspiring more research studies.

What problem does this paper attempt to address?

This paper focuses on the application of event cameras and deep learning in the fields of computer vision and robotics. Event cameras are a new type of sensor that can capture pixel-level brightness changes asynchronously, generating an event stream with time, position, and polarity encoding. Compared to traditional frame-based cameras, event cameras have advantages such as high temporal resolution, high dynamic range, and low latency, making them especially suitable for acquiring information under extreme lighting or high-speed motion conditions. In recent years, deep learning has been introduced into the research of event cameras, which has stimulated a large amount of research work on exploring their potential. However, there is currently a lack of systematic classification and investigation of deep learning techniques for event cameras. This paper first analyzes the typical representation methods of event data and their quality enhancement techniques, and then comprehensively reviews the existing deep learning methods, categorizing them into two main categories: image/video reconstruction and restoration, as well as event-driven scene understanding and 3D vision. The authors conducted benchmark experiments to identify key insights and issues, and discussed the future directions of research. The main contributions of the paper include providing a comprehensive overview of event data representation and quality enhancement, summarizing the applications of existing deep learning methods in event visual tasks, discussing challenges and proposing future research directions. It also created an open-source repository containing all mentioned papers and code links for tracking the latest research progress. In summary, this paper aims to fill the gap in systematic research on the integration of event cameras and deep learning applications, providing references and guidance for researchers in related fields.

Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks

A Review of Image Reconstruction Based on Event Cameras

Event-based Stereo Depth Estimation: A Survey

Recent Event Camera Innovations: A Survey

A Survey on Deep Learning Event Extraction: Approaches and Applications

Event-based Simultaneous Localization and Mapping: A Comprehensive Survey

Recent Advances in Bio-Inspired Vision Sensor: A Review

Event-Based Vision Enhanced: A Joint Detection Framework in Autonomous Driving

Deep Learning in Visual Tracking: A Review

E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep Learning

Deep Event Visual Odometry

Event-Based Low-Illumination Image Enhancement

EventAid: Benchmarking Event-aided Image/Video Enhancement Algorithms with Real-captured Hybrid Dataset

Deep Learning for Visual Tracking: A Comprehensive Survey

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction

A Survey on Deep Learning Methods for Robot Vision

VESS - Variable Event Stream Structure for Event-based Instance Segmentation Benchmark.

Deep Representation Via Convolutional Neural Network for Classification of Spatiotemporal Event Streams

Deblurring Low-Light Images with Events

Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline

Deep Learning for Camera Calibration and Beyond: A Survey