Boundary-aware dichotomous image segmentation
Haonan Tang,Shuhan Chen,Yang Liu,Shiyu Wang,Zeyu Chen,Xuelong Hu
DOI: https://doi.org/10.1007/s00371-024-03295-5
IF: 2.835
2024-02-26
The Visual Computer
Abstract:Dichotomous Image Segmentation is a category-agnostic task aims to segment highly accurate objects from natural images. In semantic segmentation tasks, the utilization of high-resolution input, due to its richer contextual information, can effectively enhance the precision of segmentation and the accuracy of the boundaries. However, in DIS tasks, owing to the increased complexity and diversity of the targets, it is challenging to generate complete segmentation results. Directly employing high-resolution images as input may result in missed detection due to insufficient receptive field coverage. Furthermore, the additional potential details introduced by high resolution, which may not directly relate to the targets, can negatively impact the accuracy of the model’s boundary predictions. To address the above problems, a dual-branch network structure is adopted, where the high-resolution input branch learns detailed information, and the low-resolution input branch captures global semantic information. Specifically, we use the Feature Pyramid Transfer module to enlarge the receptive field and enhance the semantic consistency between different Resnet blocks. In the decoder, we propose the Boundary-Aware structure to fuse features from different backbones and use boundary information to generate more accurate segmentation results. The experimental results demonstrate that our method achieves leading performance in four of the six evaluation metrics used in experiments on the complete DIS-TE testset. For instance, the F-measure reaches 0.801, which is 10.3% higher than that of the ISNet, the baseline method for the DIS task.
computer science, software engineering