Scene classification with improved AlexNet model

Lisha Xiao,Qin Yan,Shuyu Deng
DOI: https://doi.org/10.1109/iske.2017.8258820
2017-11-01
Abstract:Scene classification is an important research branch of image comprehension, which gains information from images and interprets them using computer system by imitating the biological systems of human beings. AlexNet model is limited in image classification because of the large convolution kernel and stride in the first convolutional layer leading to over rapid decline of feature maps resolution and excessive compression of spatial information. This paper proposed an improved AlexNet model according to the design principle of convolutional neural networks (CNNs). The large convolution kernel is decomposed into a structure cascaded by two small convolution kernels with reduced stride. Another convolutional layer is added after the first one to enhance the integration process of the low-level features or the spatial information. The asymmetric convolution kernel is applied in the last three convolutional layers. The experiments on two datasets show that the classification accuracy of the improved AlexNet model is higher than those of AlexNet model and ZFNet model for 23 categories of scene classification.
What problem does this paper attempt to address?