Efficient channel expansion and pyramid depthwise-pointwise-depthwise neural networks
Guoqing Li,Meng Zhang,Yu Zhang,Ruixia Wu,Dongpeng Weng
DOI: https://doi.org/10.1007/s10489-021-03152-1
IF: 5.3
2022-02-15
Applied Intelligence
Abstract:In popular lightweight convolutional neural networks (CNNs), pointwise convolution (PWC) layers for combining information occupy approximately 70% weights and computation, but depthwise convolution (DWC) layers for extracting spatial information only occupy less than 2% weights and computation. The weights and computation for extracting spatial information are not enough in lightweight CNNs. In this paper, we proposed expanding the number of channels and improving the extraction of spatial information by more efficient DWC instead of PWC. Firstly, the results of the proposed PSDNet demonstrate that DWC is more efficient than PWC for channel expansion and it can improve the accuracy of the network. Then, the efficient Depthwise-Pointwise-Depthwise (DPD) block is proposed by using DWC to expand channels. Different from the general bottleneck block, the DPD block consists of one PWC layer and two DWC layers. Four kinds of efficient lightweight DPDNets (DPDNet-G, DPDNet-A, DPDNet-C, DPDNet-D) are proposed by stacking different DPD blocks. To extract multi-scale features and achieve high accuracy, the pyramid DWC layer is used when channel expansion in DPDNet. Compared with common lightweight CNNs, DPDNets use more weights and computation in the DWC layer for extracting spatial information. Four competitive benchmark datasets (CIFAR-10, CIFAR-100, ImageNet, and PASCAL VOC) were used to verify the superiority of DPDNet. Experiments demonstrate that the proposed DPDNet has higher accuracy than MobileNet with a similar number of weights and computations. Furthermore, compared DPDNet with MobileNet, it can be found that improving the ratio of DWC to PWC can improve accuracy, which helps researchers to design better lightweight CNNs.
computer science, artificial intelligence