Remote Sensing Scene Classification Using Multilayer Stacked Covariance Pooling

Nanjun He,Leyuan Fang,Shutao Li,Antonio Plaza,Javier Plaza
DOI: https://doi.org/10.1109/tgrs.2018.2845668
IF: 8.2
2018-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:This paper proposes a new method, called multilayer stacked covariance pooling (MSCP), for remote sensing scene classification. The innovative contribution of the proposed method is that it is able to naturally combine multilayer feature maps, obtained by pretrained convolutional neural network (CNN) models. Specifically, the proposed MSCP-based classification framework consists of the following three steps. First, a pretrained CNN model is used to extract multilayer feature maps. Then, the feature maps are stacked together, and a covariance matrix is calculated for the stacked features. Each entry of the resulting covariance matrix stands for the covariance of two different feature maps, which provides a natural and innovative way to exploit the complementary information provided by feature maps coming from different layers. Finally, the extracted covariance matrices are used as features for classification by a support vector machine. The experimental results, conducted on three challenging data sets, demonstrate that the proposed MSCP method can not only consistently outperform the corresponding single-layer model but also achieve better classification performance than other pretrained CNN-based scene classification methods.
What problem does this paper attempt to address?