Diverse Capsules Network Combining Multiconvolutional Layers for Remote Sensing Image Scene Classification

Asif Raza,Hong Huo,Salayidin Sirajuddin,Tao Fang
DOI: https://doi.org/10.1109/JSTARS.2020.3021045
IF: 4.715
2020-01-01
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Abstract:Remote sensing image scene classification has drawn significant attention for its potential applications in the economy and livelihoods. Unlike the traditional handcrafted features, the convolutional neural networks provide an excellent avenue in obtaining powerful discriminative features. Although tremendous efforts have been made so far in this domain, there are still many open challenges in scene classification due to the scene complexity with higher within-class diversity and between-class similarity. To solve the above-mentioned problems, DcapsulesNet (D-CapsNet) is proposed to learn the richer and more robust features for scene classification. It is an end to end network with four types of layers and incorporates visual attention mechanisms. Its diverse capsules encode different properties of complex image scenes, including deep high-level features, spatial attention based on the fusion of multilayers features, both spatial and channel attention based on high-level features, and their fusion. Experiments on three image scene datasets demonstrate that D-CapsNet outperforms other baselines and state-of-the-art methods with a significant improvement in both classification accuracy and speed.
What problem does this paper attempt to address?