Abstract:With the rapid development of various satellite sensors, automatic and advanced scene classification technique is urgently needed to process a huge amount of satellite image data. Recently, a few of research works start to implant the sparse coding for feature learning in aerial scene classification. However, these previous research works use the single-layer sparse coding in their system and their performances are highly related with multiple low-level features, such as scale-invariant feature transform (SIFT) and saliency. Motivated by the importance of feature learning through multiple layers, we propose a new unsupervised feature learning approach for scene classification on very high resolution satellite imagery. The proposed unsupervised feature learning utilizes multipath sparse coding architecture in order to capture multiple aspects of discriminative structures within complex satellite scene images. In addition, the dense low-level features are extracted from the raw satellite data by using different image patches with varying size at different layers, and this approach is not limited to a particularly designed feature descriptors compared with the other related works. The proposed technique has been evaluated on two challenging high-resolution datasets, including the UC Merced dataset containing 21 different aerial scene categories with a 1 foot resolution and the Singapore dataset containing 5 land-use categories with a 0.5m spatial resolution. Experimental results show that it outperforms the state-of-the-art that uses the single-layer sparse coding. The major contributions of this proposed technique include (1) a new unsupervised feature learning approach to generate feature representation for very high-resolution satellite imagery, (2) the first multipath sparse coding that is used for scene classification in very high-resolution satellite imagery, (3) a simple low-level feature descriptor instead of many particularly designed low-level descriptor, such as SIFT descriptors and saliency, (4) evaluation on two satellite image datasets that come from different sensor sources.

Learning Coarse-To-Fine Sparselets For Efficient Object Detection And Scene Classification

SSF: Sparse Point Cloud Object Detection Based on Self-Adaptive Voxel Encoding and Focal-Sparse Convolution

A Lightweight SE-YOLOv3 Network for Multi-Scale Object Detection in Remote Sensing Imagery.

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Learning to detect objects in images via a sparse, part-based representation

L1-norm Latent SVM for Compact Features in Object Detection.

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-Based 3-D Object Detection

Cascaded Sparse Spatial Bins for Efficient and Effective Generic Object Detection

Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection

Multipath Sparse Coding for Scene Classification in Very High Resolution Satellite Imagery

SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining

Enhanced Sparse Detection for End-to-End Object Detection

SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection

DeNet: Scalable Real-Time Object Detection with Directed Sparse Sampling

Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection

Fully Sparse Fusion for 3D Object Detection

Selective Sparse Sampling for Fine-Grained Image Recognition

Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids.

A novel self-taught learning framework using spatial pyramid matching for scene classification