Abstract:After decades of research, there is still no solution for indoor localization like the GNSS (Global Navigation Satellite System) solution for outdoor environments. The major reasons for this phenomenon are the complex spatial topology and RF transmission environment. To deal with these problems, an indoor scene constrained method for localization is proposed in this paper, which is inspired by the visual cognition ability of the human brain and the progress in the computer vision field regarding high-level image understanding. Furthermore, a multi-sensor fusion method is implemented on a commercial smartphone including cameras, WiFi and inertial sensors. Compared to former research, the camera on a smartphone is used to "see" which scene the user is in. With this information, a particle filter algorithm constrained by scene information is adopted to determine the final location. For indoor scene recognition, we take advantage of deep learning that has been proven to be highly effective in the computer vision community. For particle filter, both WiFi and magnetic field signals are used to update the weights of particles. Similar to other fingerprinting localization methods, there are two stages in the proposed system, offline training and online localization. In the offline stage, an indoor scene model is trained by Caffe (one of the most popular open source frameworks for deep learning) and a fingerprint database is constructed by user trajectories in different scenes. To reduce the volume requirement of training data for deep learning, a fine-tuned method is adopted for model training. In the online stage, a camera in a smartphone is used to recognize the initial scene. Then a particle filter algorithm is used to fuse the sensor data and determine the final location. To prove the effectiveness of the proposed method, an Android client and a web server are implemented. The Android client is used to collect data and locate a user. The web server is developed for indoor scene model training and communication with an Android client. To evaluate the performance, comparison experiments are conducted and the results demonstrate that a positioning accuracy of 1.32 m at 95% is achievable with the proposed solution. Both positioning accuracy and robustness are enhanced compared to approaches without scene constraint including commercial products such as IndoorAtlas.

InstaIndoor and multi-modal deep learning for indoor scene recognition

Indoor Scene Recognition: An Attention-Based Approach Using Feature Selection-Based Transfer Learning and Deep Liquid State Machine

Look and Listen: A Multi-modality Late Fusion Approach to Scene Classification for Autonomous Machines

Indoor Scene Recognition via Object Detection and TF-IDF

Indoor scene recognition through object detection

Deep Learning Based Application for Indoor Scene Recognition

Indoor Scene Recognition in 3D

Indoor Space Recognition using Deep Convolutional Neural Network: A Case Study at MIT Campus

Indoor Scene Recognition Mechanism Based on Direction-Driven Convolutional Neural Networks

Recognition of Indoor Scenes Using 3-D Scene Graphs

An event-based approach to multi-modal activity modeling and recognition

Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification

What can i do around here? Deep functional scene understanding for cognitive robots

MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments

Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition

Indoor scene recognition by a mobile robot through adaptive object detection

Scene Recognition for Indoor Localization Using a Multi-Sensor Fusion Approach

A deep learning-based global and segmentation-based semantic feature fusion approach for indoor scene classification

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

What's in my Room? Object Recognition on Indoor Panoramic Images