Generic Convolutional Neural Network with Random Pooling Area

Zhidong Deng,Zhenyang Wang,Shiyao Wang
2016-01-01
Abstract:As an automated hierarchical feature extractor, deep convolutional neural network (CNN) is increasingly in the spotlight. In order to further improve feature representation capabilities of CNN, this paper proposes a novel SAPNet model that incorporates a stochastic area pooling (SAP) method with a generic stacked T-shaped CNN architecture. In our SAP method, pooling area is randomly transformed and max pooling operation is then conducted on such areas, which means that regular pooling area of fixed upright squares are no longer exploited in the training phase of our SAPNet. In a sense, it could be viewed as the use of feature-level augmentation. Meanwhile, we present a generic CNN architecture that structurally resembles three stacked T-shaped cubes. In such architecture, the number of kernels in convolutional layer preceding any pooling layer is doubled and all learnable weight layers are combined with batch normalization and dropout with a small ratio. Interestingly, all SAPNets have the same structures and similar parameter settings on different benchmarks. Finally, on CIFAR-10, CIFAR-100, MNIST, and SVHN datasets, the experimental results show that our SAPNet requires fewer parameters than regular CNN models but nevertheless achieves superior recognition performances for all the four benchmarks.
What problem does this paper attempt to address?