Agree to Disagree: Exploring Partial Semantic Consistency against Visual Deviation for Compositional Zero-Shot Learning

Xiangyu Li,Xu Yang,Xi Wang,Cheng Deng
DOI: https://doi.org/10.1109/tcds.2024.3367957
IF: 4.546
2024-01-01
IEEE Transactions on Cognitive and Developmental Systems
Abstract:Compositional Zero-Shot Learning (CZSL) aims to recognize novel concepts from known sub-concepts. However, it is still challenging since the intricate interaction between sub-concepts is entangled with their corresponding visual features, which affects the recognition accuracy of concepts. Besides, the domain gap between training and testing data leads to the model poor generalization. In this paper, we tackle these problems by exploring Partial Semantic Consistency to eliminate Visual Deviation to guarantee the discrimination and generalization of representations. Considering the complicated interaction between sub-concepts and their visual features, we decompose seen images into visual elements according to their labels and obtain the instance-level sub-deviations from compositions, which is utilized to excavate the category-level primitives of sub-concepts. Furthermore, we present a multi-scale concept composition approach to produce virtual samples from two aspects, which augments the sufficiency and diversity of samples so that the proposed model can generalize to novel compositions. Extensive experiments indicate that our method significantly outperforms the state-of-the- art approaches on three benchmark datasets.
robotics,computer science, artificial intelligence,neurosciences
What problem does this paper attempt to address?