Which Target to Focus On: Class-Perception for Semantic Segmentation of Remote Sensing.

Long Sun,Lingling Li,Yilin Shao,Licheng Jiao,Xu Liu,Puhua Chen,Fang Liu,Shuyuan Yang,Biao Hou
DOI: https://doi.org/10.1109/tgrs.2023.3278133
IF: 8.2
2023-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Deep-learning-based (DL) methods have dominated the task of semantic segmentation of remote sensing images. However, the sizes of different objects vary widely, and there is a great deal of label noise due to the inevitable shadows. Therefore, there is an urgent need for a method that can precisely handle complex ground data. In this article, we propose an interclass enhanced network (ICEN) for representing features of varying sizes. It comprises two branches: sparse representation network (SPN) and feature extraction network (FEN). Then, a class-perception block (CPB) is inserted between the two branches to instruct the SPN’s low-level semantic features to be merged into the deeper network. Such a block can reduce label noise in remote sensing image segmentation. In addition, the proposed EIRI provides a more precise classification process for target edges containing many misclassified points without requiring excessive computational overhead. The experimental results of our proposed class-perception network (C-PNet) achieve competitive performance on the Vaihingen, Potsdam, LoveDA, and UAVid datasets.
What problem does this paper attempt to address?