Remote Sensing Object Detection in the Deep Learning Era—A Review

Shengxi Gui,Shuang Song,Rongjun Qin,Yang Tang
DOI: https://doi.org/10.3390/rs16020327
IF: 5
2024-01-13
Remote Sensing
Abstract:Given the large volume of remote sensing images collected daily, automatic object detection and segmentation have been a consistent need in Earth observation (EO). However, objects of interest vary in shape, size, appearance, and reflecting properties. This is not only reflected by the fact that these objects exhibit differences due to their geographical diversity but also by the fact that these objects appear differently in images collected from different sensors (optical and radar) and platforms (satellite, aerial, and unmanned aerial vehicles (UAV)). Although there exists a plethora of object detection methods in the area of remote sensing, given the very fast development of prevalent deep learning methods, there is still a lack of recent updates for object detection methods. In this paper, we aim to provide an update that informs researchers about the recent development of object detection methods and their close sibling in the deep learning era, instance segmentation. The integration of these methods will cover approaches to data at different scales and modalities, such as optical, synthetic aperture radar (SAR) images, and digital surface models (DSM). Specific emphasis will be placed on approaches addressing data and label limitations in this deep learning era. Further, we survey examples of remote sensing applications that benefited from automatic object detection and discuss future trends of the automatic object detection in EO.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The paper aims to address the problem of object detection in remote sensing images, with a focus on object detection methods in the era of deep learning and their applications in the field of remote sensing. Specifically: 1. **Object Detection and Segmentation**: The paper explores how to use deep learning techniques for object detection (i.e., identifying the location and category of objects of interest in an image) and instance segmentation (i.e., extracting the boundaries of individual detected objects). Additionally, it discusses panoptic segmentation (including the segmentation of background categories). 2. **Application of Multimodal Data**: Due to the diversity and complexity of remote sensing data, such as optical images and synthetic aperture radar (SAR) images, the paper emphasizes the importance of multimodal data. By combining data from different types of sensors, the discriminative ability of features can be improved, especially in situations with limited labeled data. 3. **Few-Shot Learning and Language Models**: The paper also explores how to utilize existing large amounts of natural images and remote sensing images for few-shot learning (X-shot learning), and how to use pre-trained networks to enhance object detection performance. Additionally, the paper mentions the potential of using language models to achieve automated fine-grained object detection. 4. **Overview of Remote Sensing Sensors**: The paper provides a detailed introduction to the characteristics of different remote sensing sensors, including optical sensors, SAR sensors, as well as LiDAR and photogrammetry data. These sensors have unique features in terms of resolution, spectral coverage, etc., and are suitable for different application scenarios. In summary, the paper aims to update and systematically synthesize current deep learning-based object detection methods, with a particular focus on the application of multimodal data processing and language models to address the challenges of object detection in remote sensing data.