Class‐Relation Reasoning with Knowledge‐Transfer for Few‐Shot Object Detection

Xin Feng,Zhixian Zhang,Junjie Wang,Siping Wang,Xiaoning Jiao
DOI: https://doi.org/10.1002/tee.24037
IF: 0.923
2024-03-06
IEEJ Transactions on Electrical and Electronic Engineering
Abstract:Few‐Shot Object Detection (FSOD) task involves accurately identifying target object classes using only a small set of labeled samples. Most of the current FSOD tasks independently predict class prototype features without considering class relationships and only rely on visual information. To address these challenges, we propose a novel Class‐relational Reasoning Method with Knowledge‐transfer (CRK‐Net), built on the meta‐learning‐based framework. Although data may be scarce, the semantic relationship between classes is invariant, Joint‐feature Fusion Module (JFM) are hence proposed to transfers the semantic information of different categories in the natural language world to integrate with visual information and produce multi‐modality embeddings. Some base classes and novel classes have similar features, so this can be borrowed by modeling the relationship between classes feature. Building upon the observation, we propose a Class‐relational Reasoning Module (CRM) to establish the correlations between categories and enhance prototype representations for each category. After passing through the JFM and CRM modules, a high‐quality class prototype is finally produced for subsequent regression and classification. Extensive experiments on PASCAL VOC demonstrate the effectiveness of our proposed method and provide a new scheme for fusing semantic and visual information. © 2024 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
engineering, electrical & electronic
What problem does this paper attempt to address?