Abstract:Deep learning has revolutionized computing in many real-world applications, arguably due to its remarkable performance and extreme convenience as an end-to-end solution. However, deep learning models can be costly to train and to use, especially for those large-scale models, making it necessary to optimize the original overly complicated models into smaller ones in scenarios with limited resources such as mobile applications or simply for resource saving. The key question in such model optimization is, how can we effectively identify and measure the redundancy in a deep learning model structure. While several common metrics exist in the popular model optimization techniques to measure the performance of models after optimization, they are not able to quantitatively inform the degree of remaining redundancy. To address the problem, we present a novel testing approach, i.e., RedTest, which proposes a novel testing metric called Model Structural Redundancy Score (MSRS) to quantitatively measure the degree of redundancy in a deep learning model structure. We first show that MSRS is effective in both revealing and assessing the redundancy issues in many state-of-the-art models, which urgently calls for model optimization. Then, we utilize MSRS to assist deep learning model developers in two practical application scenarios: 1) in Neural Architecture Search, we design a novel redundancy-aware algorithm to guide the search for the optimal model structure and demonstrate its effectiveness by comparing it to existing standard NAS practice; 2) in the pruning of large-scale pre-trained models, we prune the redundant layers of pre-trained models with the guidance of layer similarity to derive less redundant ones of much smaller size. Extensive experimental results demonstrate that removing such redundancy has a negligible effect on the model utility.

Reducing Overfitting in Deep Convolutional Neural Networks Using Redundancy Regularizer

DCCD: Reducing Neural Network Redundancy Via Distillation

Reducing Overfitting in Deep Networks by Decorrelating Representations

Structured Pruning for Efficient Convolutional Neural Networks Via Incremental Regularization

Wordreg: Mitigating the Gap Between Training and Inference with Worst-Case Drop Regularization

Structured Deep Neural Network Pruning by Varying Regularization Parameters.

SparseConnect: Regularising CNNs on Fully Connected Layers

Reliable Identification of Redundant Kernels for Convolutional Neural Network Compression.

Overfitting Remedy by Sparsifying Regularization on Fully-Connected Layers of CNNs.

KRR-CNN: kernels redundancy reduction in convolutional neural networks

Dropout Reduces Underfitting

Convolutional Neural Networks With Dynamic Regularization

Effective and Efficient Dropout for Deep Convolutional Neural Networks

Kernel Orthogonality does not necessarily imply a Decrease in Feature Map Redundancy in CNNs: Convolutional Similarity Minimization

SlimConv: Reducing Channel Redundancy in Convolutional Neural Networks by Features Recombining

ResNet Structure Simplification with the Convolutional Kernel Redundancy Measure

Improving Model Robustness Against Adversarial Examples with Redundant Fully Connected Layer.

Regularizing Deep Convolutional Neural Networks with a Structured Decorrelation Constraint.

LocalDrop: A Hybrid Regularization for Deep Neural Networks

A New Measure of Model Redundancy for Compressed Convolutional Neural Networks

RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively