Zero-shot learning via a specific rank-controlled semantic autoencoder

Yang Liu,Xinbo Gao,Jungong Han,Li Liu,Ling Shao
DOI: https://doi.org/10.1016/j.patcog.2021.108237
IF: 8
2022-02-01
Pattern Recognition
Abstract:Existing embedding zero-shot learning models usually learn a projection function from the visual feature space to the semantic embedding space, e.g. attribute space or word vector space. However, the projection learned based on seen samples may not generalize well to unseen classes, which is known as the projection domain shift problem in ZSL. To address this issue, we propose a method named Low-rank Semantic Autoencoder (LSA) to consider the low-rank structure of seen samples maintain the sparse feature of reconstruction error, which can further improve zero-shot learning capability. Moreover, in order to obtain a more robust projection for unseen classes, we additionally propose a Specific Rank-controlled Semantic Autoencoder (SRSA) to achieve an accurate control of projection's rank. Extensive experiments on six benchmarks demonstrate the superiority of the proposed models over most existing embedding ZSL models under the standard zero-shot setting and the more realistic generalized zero-shot setting.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?