Neural Fine-Tuning Search for Few-Shot Learning

Panagiotis Eustratiadis,Łukasz Dudziak,Da Li,Timothy Hospedales
2023-06-16
Abstract:In few-shot recognition, a classifier that has been trained on one set of classes is required to rapidly adapt and generalize to a disjoint, novel set of classes. To that end, recent studies have shown the efficacy of fine-tuning with carefully crafted adaptation architectures. However this raises the question of: How can one design the optimal adaptation strategy? In this paper, we study this question through the lens of neural architecture search (NAS). Given a pre-trained neural network, our algorithm discovers the optimal arrangement of adapters, which layers to keep frozen and which to fine-tune. We demonstrate the generality of our NAS method by applying it to both residual networks and vision transformers and report state-of-the-art performance on Meta-Dataset and Meta-Album.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of how to design the optimal adaptation strategy in Few-Shot Learning (FSL) to enable effective rapid adaptation across multiple visual domains. Specifically, the paper focuses on how to find the optimal adapter configuration through Neural Architecture Search (NAS) based on a given pre-trained neural network. This includes determining which layers need to remain frozen, which layers need fine-tuning, and where to insert adapter modules to achieve effective adaptation to new categories and domains. The paper proposes a new method called Neural Fine-Tuning Search (NFTS), which aims to optimize the balance between adaptability and generalization in few-shot learning by systematically exploring these configurations. NFTS constructs a supernet containing all possible configurations and trains this supernet by randomly sampling paths during training. Subsequently, during testing, NFTS selects the model that best fits the current support set data from a pre-selected set of diverse models for adaptation. This approach not only improves computational efficiency but also reduces the risk of overfitting by introducing diversity constraints during training, especially when training data is limited. The paper validates the effectiveness of NFTS through experiments on two benchmark datasets, Meta-Dataset and Meta-Album, demonstrating its superior performance in multi-domain few-shot learning tasks.