DDAug: Differentiable Data Augmentation for Weakly Supervised Semantic Segmentation
Boyang Li,Fei Zhang,Longguang Wang,Yingqian Wang,Ting Liu,Zaiping Lin,Wei An,Yulan Guo
DOI: https://doi.org/10.1109/tmm.2023.3326300
IF: 7.3
2023-01-01
IEEE Transactions on Multimedia
Abstract:Weakly supervised semantic segmentation (WSSS) with image-level labels has witnessed promising advances with the help of class activation maps (CAM). However, CAM is always confined to small discriminative seed regions due to its simple classification loss guided training manner. To handle this problem, recent works introduced specifically designed regularizations and modules to expand the CAM seed regions, serving as the final segmentation masks. In this paper, we surprisingly find that the classification loss could suppress the gains from these regularization and modules in the late training phase, thereby limiting the further growth of CAM, which we call as the explicit supervision disturb (ESD) issue. Interestingly, we find that specific data augmentation (DA) operations (e.g., CutMix) can relieve such ESD issue, and the benefits introduced by different DA operations vary a lot. To maximize the benefits, we propose differentiable data augmentation (DDAug) to automatically search for the proper DA policy. Specifically, we design a multi-level search space to sequentially sample DA operations with different properties. Extensive experiments demonstrate that the proposed DDAug can alleviate the ESD issue and introduce consistent improvements to various popular WSSS methods, achieving the state-of-the-art performance on the MS COCO 2014 and PASCAL VOC 2012 datasets.
computer science, information systems,telecommunications, software engineering