Abstract:Indoor environment is a common scene in our everyday life, and detecting and tracking multiple targets in this environment is a key component for many applications. However, this task still remains challenging due to limited space, intrinsic target appearance variation, e. g. full or partial occlusion, large pose deformation, and scale change. In the proposed approach, we give a novel framework for detection and tracking in indoor environments, and extend it to robot navigation. One of the key components of our approach is a virtual top view created from an RGB-D camera, which is named ground plane projection (GPP). The key advantage of using GPP is the fact that the intrinsic target appearance variation and extrinsic noise is far less likely to appear in GPP than in a regular side-view image. Moreover, it is a very simple task to determine free space in GPP without any appearance learning even from a moving camera. Hence GPP is very different from the top-view image obtained from a ceiling mounted camera. We perform both object detection and tracking in GPP. Two kinds of GPP images are utilized: gray GPP, which represents the maximal height of 3D points projecting to each pixel, and binary GPP, which is obtained by thresholding the gray GPP. For detection, a simple connected component labeling is used to detect footprints of targets in binary GPP. For tracking, a novel Pixel Level Association (PLA) strategy is proposed to link the same target in consecutive frames in gray GPP. It utilizes optical flow in gray GPP, which to our best knowledge has never been done before. Then we "back project" the detected and tracked objects in GPP to original, sideview (RGB) images. Hence we are able to detect and track objects in the side-view (RGB) images. Our system is able to robustly detect and track multiple moving targets in real time. The detection process does not rely on any target model, which means we do not need any training process. Moreover, tracking does not require any manual initialization, since all entering objects are robustly detected. We also extend the novel framework to robot navigation by tracking. As our experimental results demonstrate, our approach can achieve near prefect detection and tracking results. The performance gain in comparison to state-of-the-art trackers is most significant in the presence of occlusion and background clutter.

Multiple People Tracking for Mobile Robots in Extended Two-Dimensional Environment

Multi-modal 3D Human Tracking for Robots in Complex Environment with Siamese Point-Video Transformer

Accurate and Real-Time 3-D Tracking for the Following Robots by Fusing Vision and Ultrasonar Information

Human Tracking and Following in Dynamic Environment for Service Robots

Beyond Traditional Driving Scenes: A Robotic-Centric Paradigm for 2D+3D Human Tracking Using Siamese Transformer Network

People Following System Based on LRF

Real-time 3D Human Tracking for Mobile Robots with Multisensors

Fusing monocular vision and laser range data for human detection and tracking via multi-robot collaboration

A multi-camera approach to tracking and localization of people with coexisting robots

Reliable people tracking approach for mobile robot in indoor environments

Online Multiple Targets Detection and Tracking from Mobile Robot in Cluttered Indoor Environments with Depth Camera

Human tracking of robots based on double-layer cooperative locating mechanism

Multi-modal Tracking of People Using Laser Scanners and Video Camera

Real-Time Rgb-D Based People Detection And Tracking System For Mobile Robots

A Robust Person Tracking and Following Approach for Mobile Robot

A Method of Target Tracking Based on Monocular Vision for Mobile Robot in Unknown Environment

Person following of binocular robot by extracting multiple features

Sensor Integration for Person Tracking and Following with Mobile Robot

Tracking multiple people using laser and vision

People tracking and following using multiple RGB-D cameras

Human Following for Wheeled Robot with Monocular Pan-tilt Camera