Self-Attentive Networks for One-Shot Image Recognition

Pin Fang,Yisen Wang,Yuan Luo
DOI: https://doi.org/10.1109/icme.2019.00165
2019-01-01
Abstract:Despite recent breakthroughs in applications of deep neural networks, learning from a limited number of examples remains a key challenge in academia and industry. A prototypical example is one-shot learning, in which we must make predictions when only given one sample of each unseen class. Most previous works either categorize instances by the pair similarities or use Long-Short Term Memory (LSTM) structure based method, however, they usually suffer from inter-and intra-class variation problem or high computation cost. In this paper, we propose self-attentive networks to address the above issues in one-shot image recognition problem. Deep neural features and attention mechanism especially self-attention are employed in the proposed self-attentive networks. Experiments on Omniglot and MiniImagenet datasets show that our proposed self-attentive networks achieve state-of-the-art accuracy on one-shot image recognition. In addition, owing to low time complexity and parallelizable architecture, self-attentive networks require significantly less time to train and infer.
What problem does this paper attempt to address?