TransUNet with unified focal loss for class-imbalanced semantic segmentation

Kento Wakamatsu,Satoshi Ono
DOI: https://doi.org/10.1007/s10015-023-00919-2
2023-12-21
Artificial Life and Robotics
Abstract:Class imbalanceness, i.e., the inequality of the number of samples between categories, adversely affects machine learning models, including deep neural networks. In semantic segmentation, extracting a small area of minor categories with respect to the entire image includes the same problem as class imbalanceness. Such difficulties exist in various applications of semantic segmentation, including medical images. This paper proposes a semantic segmentation method that considers global features and appropriately detects small categories. The proposed method adopts TransUNet architecture and Unified Focal Loss (UFL) function; the former allows considering global image features, and the latter mitigates the harmful effects of class imbalanceness. Experimental results with real-world applications showed that the proposed method successfully extracts small regions of minor classes without increasing false positives of other classes.
What problem does this paper attempt to address?