MultiMedia Modeling: 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II

Wen-Huang Cheng,Junmo Kim,Wei-Ta Chu,Peng Cui,Jung-Woo Choi,Min-Chun Hu,Wesley De Neve
DOI: https://doi.org/10.1007/978-3-030-37731-1
2020-01-01
Abstract:Few-shot learning, which learns from a small number of samples, is an emerging field in multimedia. Through systematically exploring influences of scale information, including multi-scale feature extraction, multi-scale comparison and increased parameters brought by multiple scales, in this paper, we present a novel end-to-end model called Multi-scale Comparison Network (MSCN) for few-shot learning. The proposed MSCN uses different scale convolutions for comparison to solve the problem of excessive gaps between target sizes in the images during fewshot learning. It first uses a 4-layer encoder to encode support and testing samples to obtain their feature maps. After deep splicing these feature maps, the proposed MSCN further uses a comparator comprising two layers of multi-scale comparative modules and two fully connected layers to derive the similarity between support and testing samples. Experimental results on two benchmark datasets including Omniglot and miniImagenet shows the effectiveness of the proposed MSCN, which has averagely 2% improvement on miniImagenet in all experimental results compared with the recent Relation Network.
What problem does this paper attempt to address?