Abstract:Omnidirectional image (ODI) data is captured with a 360x180 field-of-view, which is much wider than the pinhole cameras and contains richer spatial information than the conventional planar images. Accordingly, omnidirectional vision has attracted booming attention due to its more advantageous performance in numerous applications, such as autonomous driving and virtual reality. In recent years, the availability of customer-level 360 cameras has made omnidirectional vision more popular, and the advance of deep learning (DL) has significantly sparked its research and applications. This paper presents a systematic and comprehensive review and analysis of the recent progress in DL methods for omnidirectional vision. Our work covers four main contents: (i) An introduction to the principle of omnidirectional imaging, the convolution methods on the ODI, and datasets to highlight the differences and difficulties compared with the 2D planar image data; (ii) A structural and hierarchical taxonomy of the DL methods for omnidirectional vision; (iii) A summarization of the latest novel learning strategies and applications; (iv) An insightful discussion of the challenges and open problems by highlighting the potential research directions to trigger more research in the community.

What problem does this paper attempt to address?

The paper attempts to address the challenges of omnidirectional vision in applications, particularly how to utilize deep learning (DL) techniques to process omnidirectional image (ODI) data. Specifically, the paper focuses on the following aspects: 1. **Special Properties of Omnidirectional Images**: Omnidirectional images have a 360°×180° field of view, capturing more information than traditional pinhole cameras, but also introducing severe distortion and content discontinuity issues. 2. **Application of Deep Learning Methods**: In recent years, with the proliferation of consumer-grade 360° cameras and the development of deep learning technologies, research and applications in omnidirectional vision have been significantly promoted. The paper systematically reviews and analyzes the latest advancements in deep learning within the field of omnidirectional vision. 3. **Datasets and Convolution Methods**: The paper introduces the imaging principles of omnidirectional images, convolution methods, and commonly used datasets, discussing the differences and challenges of these datasets compared to traditional 2D planar image data. 4. **Classification and Hierarchical Structure**: The paper proposes a structured classification system covering various deep learning methods in omnidirectional vision, including convolution filters, network design, novel learning strategies, and practical applications. 5. **Latest Research and Future Directions**: The paper summarizes the latest learning strategies and potential applications, delves into the challenges and unresolved issues in current research, and proposes future research directions. Overall, the paper aims to provide a comprehensive review for researchers in the field of omnidirectional vision, helping them better understand and address the challenges in this area.

Deep Learning for Omnidirectional Vision: A Survey and New Perspectives

Applications of Deep Learning for Top-View Omnidirectional Imaging: A Survey

Unwrapping and Stereo Rectification for Omnidirectional Images

Omnisupervised Omnidirectional Semantic Segmentation

Semantic Visual Odometry Based on Panoramic Annular Imaging

Real-time Omnidirectional Depth Perception Based on Multi-view Wide-angle Vision System

Deep Learning in Visual Tracking: A Review

Approaches, Challenges, and Applications for Deep Visual Odometry: Toward Complicated and Emerging Areas

Approaches, Challenges, and Applications for Deep Visual Odometry: Toward to Complicated and Emerging Areas

A Survey on Monocular 3D Object Detection Algorithms Based on Deep Learning

Deep learning-based perception systems for autonomous driving: A comprehensive survey

Multi-view stereo in the Deep Learning Era: A comprehensive review

Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey

Multi-view stereo in the Deep Learning Era: A comprehensive revfiew

Omni-Directional Image Generation from Single Snapshot Image

Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives

Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview

Deep Learning Meets Object-Based Image Analysis: Tasks, Challenges, Strategies, and Perspectives

Deep Learning for Unmanned Aerial Vehicle-Based Object Detection and Tracking: A Survey