Embedding enhancement with foreground feature alignment and primitive knowledge for few-shot learning

Xiaoqi Zheng,Jia Lu
DOI: https://doi.org/10.1016/j.engappai.2024.108823
IF: 8
2024-06-19
Engineering Applications of Artificial Intelligence
Abstract:Few-Shot Learning (FSL) targets a model to quickly discriminate new categories with limited samples. While most methods struggle to effectively utilize the knowledge learned in the base classes, resulting in models that fail to alleviate the domain gap between training and evaluation. Conversely, humans often enable rapid discrimination by two ways: reinforcing impressions through comparing unknown categories with related objects in memory; Searching for the maximum shared information within the intra-class. Motivated by this, in this paper, we propose a novel Transformer-based Embedding Enhancement Network (TEEN) that adaptively leverages the knowledge learned from the base classes, which we refer to as 'primitive knowledge', to complete and distinguish the embeddings for novel classes. Additionally, to reduce interference from background features, we introduce the Transformer-based Foreground Feature Alignment (TFFA) to enhance the representation of image foreground. Ultimately, TEEN enhances the embeddings through foreground alignment and primitive knowledge completion, achieving improved performance in few-shot classification. Extensive experiments on inductive few-shot tasks demonstrate the effectiveness of our approach, achieving state-of-the-art results in cross-domain few-shot tasks.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
What problem does this paper attempt to address?