Abstract:Transfer learning enables convolutional neural networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while transferring the knowledge learned from one task to another task, the deeper layers of a pre-trained CNN are finetuned over the target dataset. However, these layers are originally designed for the source task which may be over-parameterized for the target task. Thus, finetuning these layers over the target dataset may affect the generalization ability of the CNN due to high network complexity. To tackle this problem, we propose a two-stage framework called TASCNet which enables efficient knowledge transfer. In the first stage, the configuration of the deeper layers is learned automatically and finetuned over the target dataset. Later, in the second stage, the redundant filters are pruned from the fine-tuned CNN to decrease the network's complexity for the target task while preserving the performance. This two-stage mechanism finds a compact version of the pre-trained CNN with optimal structure (number of filters in a convolutional layer, number of neurons in a dense layer, and so on) from the hypothesis space. The efficacy of the proposed method is evaluated using VGG-16, ResNet-50, and DenseNet-121 on CalTech-101, CalTech-256, and Stanford Dogs datasets. Similar to computer vision tasks, we have also conducted experiments on Movie Review Sentiment Analysis task. The proposed TASCNet reduces the computational complexity of pre-trained CNNs over the target task by reducing both trainable parameters and FLOPs which enables resource-efficient knowledge transfer. The source code is available at: https://github.com/Debapriya-Tula/TASCNet.

Net2net: Accelerating learning via knowledge transfer

Progressive Network Grafting for Few-Shot Knowledge Distillation

Transferring Core Knowledge via Learngenes

Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model

Robust and Efficient Transfer Learning via Supernet Transfer in Warm-started Neural Architecture Search

Energy-efficient and Robust Cumulative Training with Net2Net Transformation

Target aware network architecture search and compression for efficient knowledge transfer

Attention Bridging Network For Knowledge Transfer

Unleashing the potential of GNNs via Bi-directional Knowledge Transfer

Adaptive knowledge transfer for class incremental learning

Gated Transfer Network for Transfer Learning

MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities

Online Learning to Accelerate Neural Network Inference with Traveling Classifiers.

Augmenting Knowledge Transfer across Graphs

NES-TL: Network Embedding Similarity-Based Transfer Learning

Network Grafting: Transferring Learned Features from Trained Neural Networks

Towards Understanding the Transferability of Deep Representations

Robust Knowledge Transfer Via Hybrid Forward on the Teacher-Student Model

Knowledge Projection for Deep Neural Networks

Real-Time Decentralized knowledge Transfer at the Edge

Online Knowledge Distillation via Collaborative Learning