Abstract:Indoor environment is a common scene in our everyday life, and detecting and tracking multiple targets in this environment is a key component for many applications. However, this task still remains challenging due to limited space, intrinsic target appearance variation, e. g. full or partial occlusion, large pose deformation, and scale change. In the proposed approach, we give a novel framework for detection and tracking in indoor environments, and extend it to robot navigation. One of the key components of our approach is a virtual top view created from an RGB-D camera, which is named ground plane projection (GPP). The key advantage of using GPP is the fact that the intrinsic target appearance variation and extrinsic noise is far less likely to appear in GPP than in a regular side-view image. Moreover, it is a very simple task to determine free space in GPP without any appearance learning even from a moving camera. Hence GPP is very different from the top-view image obtained from a ceiling mounted camera. We perform both object detection and tracking in GPP. Two kinds of GPP images are utilized: gray GPP, which represents the maximal height of 3D points projecting to each pixel, and binary GPP, which is obtained by thresholding the gray GPP. For detection, a simple connected component labeling is used to detect footprints of targets in binary GPP. For tracking, a novel Pixel Level Association (PLA) strategy is proposed to link the same target in consecutive frames in gray GPP. It utilizes optical flow in gray GPP, which to our best knowledge has never been done before. Then we "back project" the detected and tracked objects in GPP to original, sideview (RGB) images. Hence we are able to detect and track objects in the side-view (RGB) images. Our system is able to robustly detect and track multiple moving targets in real time. The detection process does not rely on any target model, which means we do not need any training process. Moreover, tracking does not require any manual initialization, since all entering objects are robustly detected. We also extend the novel framework to robot navigation by tracking. As our experimental results demonstrate, our approach can achieve near prefect detection and tracking results. The performance gain in comparison to state-of-the-art trackers is most significant in the presence of occlusion and background clutter.

Combining Monocular Camera and 2D Lidar for Target Tracking Using Deep Convolution Neural Network based Detection and Tracking Algorithm

Accurate and Real-Time 3-D Tracking for the Following Robots by Fusing Vision and Ultrasonar Information

Adaptive Multi-Pedestrian Tracking by Multi-Sensor: Track-to-Track Fusion Using Monocular 3D Detection and MMW Radar

Deep CNN-based Visual Target Tracking System Relying on Monocular Image Sensing.

A Multi-object Detection and Tracking Method Based on the Fusion of Lidar and Camera

An Object Localization System Using Monocular Camera and Two-Axis-Controlled Laser Ranging Sensor for Mobile Robot

Moving target tracking of mobile robots with fusion of laser scanner and monocular camera

Dynamic Object Tracking for Self-Driving Cars Using Monocular Camera and LIDAR.

An Advanced Approach to Object Detection and Tracking in Robotics and Autonomous Vehicles Using YOLOv8 and LiDAR Data Fusion

End-to-End Visual Target Tracking in Multi-robot Systems Based on Deep Convolutional Neural Network

Combining Laser-Scanning Data and Images for Target Tracking and Scene Modeling

A Tracking-By-Detection Based 3D Multiple Object Tracking for Autonomous Driving

Target tracking based on heterogeneous sensor information fusion in intelligent space

A Method of Target Tracking Based on Monocular Vision for Mobile Robot in Unknown Environment

A Real-Time Tracking Algorithm for Multi-Target UAV Based on Deep Learning.

Target Recognition and Location Based on Deep Learning

A Method for Designated Target Anti-Interference Tracking Combining YOLOv5 and SiamRPN for UAV Tracking and Landing Control

Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors

Online Multiple Targets Detection and Tracking from Mobile Robot in Cluttered Indoor Environments with Depth Camera

Robust Detection and Tracking Method for Moving Object Based on Radar and Camera Data Fusion

Target recognition and tracking system based on UAV platform