Abstract:How can we efficiently compress convolutional neural network (CNN) using depthwise separable convolution, while retaining their accuracy on classification tasks? Depthwise separable convolution, which replaces a standard convolution with a depthwise convolution and a pointwise convolution, has been used for building lightweight architectures. However, previous works based on depthwise separable convolution are limited when compressing a trained CNN model since (1) they are mostly heuristic approaches without a precise understanding of their relations to standard convolution, and (2) their accuracies do not match that of the standard convolution. In this paper, we propose Falcon , an accurate and lightweight method to compress CNN based on depthwise separable convolution. Falcon uses generalized elementwise product (GEP), our proposed mathematical formulation to approximate the standard convolution kernel, to interpret existing convolution methods based on depthwise separable convolution. By exploiting the knowledge of a trained standard model and carefully determining the order of depthwise separable convolution via GEP, Falcon achieves sufficient accuracy close to that of the trained standard model. Furthermore, this interpretation leads to developing a generalized version rank- k Falcon which performs k independent Falcon operations and sums up the result. Experiments show that Falcon (1) provides higher accuracy than existing methods based on depthwise separable convolution and tensor decomposition and (2) reduces the number of parameters and FLOPs of standard convolution by up to a factor of 8 while ensuring similar accuracy. We also demonstrate that rank- k Falcon further improves the accuracy while sacrificing a bit of compression and computation reduction rates.

Falcon: lightweight and accurate convolution based on depthwise separable convolution

FALCON: Lightweight and Accurate Convolution

FalconNet: Factorization for the Light-weight ConvNets.

Real-time Semantic Segmentation with Weighted Factorized-Depthwise Convolution

A Convolutional Neural Network Accelerator Architecture with Fine-Granular Mixed Precision Configurability.

Falcon: Accelerating Homomorphically Encrypted Convolutions for Efficient Private Mobile Network Inference

XSepConv: Extremely Separated Convolution for Efficient Deep Networks with Large Kernels

Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks

Depthwise Multiception Convolution for Reducing Network Parameters without Sacrificing Accuracy

XSepConv: Extremely Separated Convolution

Network Decoupling: From Regular to Depthwise Separable Convolutions

FALCON: A Fourier Transform Based Approach for Fast and Secure Convolutional Neural Network Predictions

A Lightweight Face Detector by Integrating the Convolutional Neural Network with the Image Pyramid

FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary

Designing Lightweight Feature Descriptor Networks with Depthwise Separable Convolution

Lightweight monocular depth estimation using a fusion-improved transformer

Comb Convolution for Efficient Convolutional Architecture

Optimized Separable Convolution: Yet Another Efficient Convolution Operator.

FPGA based Flexible Implementation of Light Weight Inference on Deep Convolutional Neural Networks

Design Light-weight 3D Convolutional Networks for Video Recognition Temporal Residual, Fully Separable Block, and Fast Algorithm

A Lightweight Block with Information Flow Enhancement for Convolutional Neural Networks