Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives

Lei Ma,Ziyun Yan,Mengmeng Li,Tao Liu,Liqin Tan,Xuan Wang,Weiqiang He,Ruikun Wang,Guangjun He,Heng Lu,Thomas Blaschke
2024-08-03
Abstract:Deep learning has gained significant attention in remote sensing, especially in pixel- or patch-level applications. Despite initial attempts to integrate deep learning into object-based image analysis (OBIA), its full potential remains largely unexplored. In this article, as OBIA usage becomes more widespread, we conducted a comprehensive review and expansion of its task subdomains, with or without the integration of deep learning. Furthermore, we have identified and summarized five prevailing strategies to address the challenge of deep learning's limitations in directly processing unstructured object data within OBIA, and this review also recommends some important future research directions. Our goal with these endeavors is to inspire more exploration in this fascinating yet overlooked area and facilitate the integration of deep learning into OBIA processing workflows.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the challenges faced when combining deep learning (DL) technology with object - based image analysis (OBIA). Specifically, the paper aims to: 1. **Summarize and expand the task sub - domains of OBIA**: With the wide application of OBIA, the paper comprehensively summarizes and expands the tasks of OBIA, including but not limited to segmentation, classification, feature extraction, change detection, time - series analysis, parameter inversion, and object detection. 2. **Address the limitations of deep learning in OBIA**: Deep - learning methods such as convolutional neural networks (CNNs) are difficult to directly handle unstructured object data with irregular characteristics, which limits their application in OBIA. Therefore, the paper identifies and summarizes five main strategies to overcome these challenges. 3. **Propose future research directions**: In order to promote the further development of deep learning in the OBIA field, the paper also proposes several important future research directions, hoping to inspire more exploration of this neglected but full - of - potential research area. ### Specific problem description of the paper - **Insufficient integration of deep learning and OBIA**: Although deep learning has made remarkable progress in the remote sensing field, especially in pixel - level or scene - level applications, the application of deep learning in object - based image analysis is still in its infancy. - **Difficulty in handling unstructured object data**: Deep - learning models usually require regular inputs (such as fixed - size image patches), while the objects in OBIA are often irregular, which makes it difficult to directly apply deep learning. - **Lack of existing literature reviews**: At present, there is no comprehensive literature review summarizing the combination of deep learning and OBIA, which seriously hinders the application and development of deep learning in the OBIA field. ### Goals of the paper By systematically reviewing and expanding the task sub - domains of OBIA and summarizing the existing challenges and solutions, the paper aims to: - **Promote the application of deep learning in OBIA**: Provide a comprehensive framework to help researchers better understand and apply deep - learning technology to OBIA tasks. - **Drive further research in academia and industry**: Provide directions and guidance for future research to solve the current technical and methodological bottlenecks. ### Formula example Some key technical points involved in the paper can be more accurately described by formulas. For example, the Sobel operator commonly used in edge detection can be represented as: \[ G_x=\begin{bmatrix} - 1&0&1\\ - 2&0&2\\ - 1&0&1 \end{bmatrix},\quad G_y = \begin{bmatrix} - 1&- 2&- 1\\ 0&0&0\\ 1&2&1 \end{bmatrix} \] where \(G_x\) and \(G_y\) are the horizontal and vertical gradient operators respectively, which are used to calculate the gradient information of the image. In this way, the paper not only solves the technical problems of the combination of deep learning and OBIA, but also provides a theoretical basis and practical guide for future research.