CSCNET: Class-Specified Cascaded Network for Compositional Zero-Shot Learning

Yanyi Zhang,Qi Jia,Xin Fan,Yu Liu,Ran He
DOI: https://doi.org/10.1109/icassp48485.2024.10446756
2024-01-01
Abstract:Attribute and object (A-O) disentanglement is a fundamental and criticalproblem for Compositional Zero-shot Learning (CZSL), whose aim is to recognizenovel A-O compositions based on foregone knowledge. Existing methods based ondisentangled representation learning lose sight of the contextual dependencybetween the A-O primitive pairs. Inspired by this, we propose a novel A-Odisentangled framework for CZSL, namely Class-specified Cascaded Network(CSCNet). The key insight is to firstly classify one primitive and thenspecifies the predicted class as a priori for guiding another primitiverecognition in a cascaded fashion. To this end, CSCNet constructsAttribute-to-Object and Object-to-Attribute cascaded branches, in addition to acomposition branch modeling the two primitives as a whole. Notably, we devise aparametric classifier (ParamCls) to improve the matching between visual andsemantic embeddings. By improving the A-O disentanglement, our frameworkachieves superior results than previous competitive methods.
What problem does this paper attempt to address?