SimDet: Cross Similarity Attention for One-shot Object Detection

Rujia Cai,Yingjie Qin,Lizhe Qi,Yunquan Sun
DOI: https://doi.org/10.1109/ijcnn52387.2021.9533941
2021-01-01
Abstract:Object detection based on the convolutional neural network requires a large of datasets for training to achieve good results. However, it is labor-intensive or unrealistic to prepare such high-quality training data in most industrial applications. Recently, one-shot object detection task was proposed aiming to tackle this challenging problem by using only one sample for reference. In this work, a new framework named SimDet based on Faster R-CNN has been proposed for one-shot object detection. Specifically, query and target image features are extracted through a Siamese network, and target features are enhanced in where has high similarity with query feature. In order to solve the problem of RPN learning difficulty with one sample, we propose a new module of Cross Similarity Module, which focuses more on the difference between query and support rather than the feature itself. Furthermore, we design a similarity loss for learning the cross similarity between target feature and query feature. Finally, the extensive experiments show that our model achieves state-of-the-art performance on PASCAL VOC under one-shot setting of detecting objects from both seen and novel classes and on MS COCO from seen classes.
What problem does this paper attempt to address?