Abstract:Visual scene recognition is an indispensable part of automatic localization and navigation. In the same scene, the appearance and viewpoint may be changed greatly, which is the largest challenge for some advanced unmanned systems,e.g. robot,vehicle and UAV,etc., to identify scenes where they have visited. Traditional methods have been subjected to hand-made feature-based paradigms for a long time, mainly relying on the prior knowledge of the designer, and are not sufficiently robust to extreme changing scenes. In this paper, we cope with scene recognition with automatically learning the representation of features from big image samples. Firstly, we propose a novel approach for scene recognition via training a slight-weight convolutional neural network (CNN) that overall has less complex and more efficient network architecture, and is trainable in the manner of end-to-end. The proposed approach uses the deep-leaning features of self-selection combining with light CNN process to perform high semantic understanding of visual scenes. Secondly, we propose to employ a salient region-based technology to extract the local feature representation of a specific scene region directly from the convolution layer based on self-selection mechanism, and each layer performs a linear operation with end-to-end manner. Furthermore, we also utilize probability statistics to calculate the total similarity of several regions in one scene to other regions, and finally rank the similarity scores to select the correct scene. We have conducted a lot of experiments to evaluate the results of performance by comparing four methods (namely, our proposed and other three well known and advanced methods). Experimental results show that the proposed method is more robust and accurate than other three well-known methods in extremely harsh environments (e. g. weak light and strong blur).

A Scene Recognition Method Using Sparse Features with Layout-Sensitive Pooling and Extreme Learning Machine

Image super-resolution representation via image patches based on extreme learning machine

Correntropy Extreme Learning Machine Based on Spatial Pyramid Matching and Local Receptive Field

Encoding Learning Network Combined with Feature Similarity Constraints for Human Action Recognition

SSC: Semantic Scan Context for Large-Scale Place Recognition

Enhanced Multi-Scale Feature Adaptive Fusion Sparse Convolutional Network for Large-Scale Scenes Semantic Segmentation

Accurate and Efficient Scene Recognition with Compact BoW and Ensemble ELM

Embedding metric learning into an extreme learning machine for scene recognition

A Navel Facial Expression Recognition Method Based On Extreme Learning Machine

Fast Low Rank Representation Based Spatial Pyramid Matching for Image Classification.

Self-Selection Salient Region-Based Scene Recognition Using Slight-Weight Convolutional Neural Network

Denoising Deep Extreme Learning Machine for Sparse Representation

Scene categorization via supervised subspace modeling and sparse representation

A Single-Stream Adaptive Scene Layout Modeling Method for Scene Recognition

Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context

Double-Keggin-type Anion-templated Synthesis of a 3D Porous Coordination Polymer of Eu(III) Ions and dpdo Ligands

Fast Image Recognition Based on Independent Component Analysis and Extreme Learning Machine

Scene Classification Based on Heterogeneous Features of Multi-Source Data

Ensemble Based Constrained-Optimization Extreme Learning Machine For Landmark Recognition

Building Feature Space of Extreme Learning Machine with Sparse Denoising Stacked-Autoencoder.

Locally Supervised Deep Hybrid Model for Scene Recognition