DIOR: Dataset for Indoor-Outdoor Reidentification -- Long Range 3D/2D Skeleton Gait Collection Pipeline, Semi-Automated Gait Keypoint Labeling and Baseline Evaluation Methods

Yuyang Chen,Praveen Raj Masilamani,Bhavin Jawade,Srirangaraj Setlur,Karthik Dantu

2023-09-22

Abstract:In recent times, there is an increased interest in the identification and re-identification of people at long distances, such as from rooftop cameras, UAV cameras, street cams, and others. Such recognition needs to go beyond face and use whole-body markers such as gait. However, datasets to train and test such recognition algorithms are not widely prevalent, and fewer are labeled. This paper introduces DIOR -- a framework for data collection, semi-automated annotation, and also provides a dataset with 14 subjects and 1.649 million RGB frames with 3D/2D skeleton gait labels, including 200 thousands frames from a long range camera. Our approach leverages advanced 3D computer vision techniques to attain pixel-level accuracy in indoor settings with motion capture systems. Additionally, for outdoor long-range settings, we remove the dependency on motion capture systems and adopt a low-cost, hybrid 3D computer vision and learning pipeline with only 4 low-cost RGB cameras, successfully achieving precise skeleton labeling on far-away subjects, even when their height is limited to a mere 20-25 pixels within an RGB frame. On publication, we will make our pipeline open for others to use.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper primarily addresses the problem of long-distance human gait recognition in both indoor and outdoor environments. Specifically, the contributions of the paper are as follows: 1. **Dataset**: A new dataset named DIOR is proposed, which includes gait data from 14 subjects, covering both indoor and outdoor environments, and includes long-distance (low-resolution) images. The dataset contains 1,649,918 frames of images, of which 802,726 frames are from an indoor motion capture system (MoCap), 847,193 frames are from outdoor environments, and 211,798 frames are long-distance images. 2. **Semi-Automatic Annotation Pipeline**: A semi-automatic annotation method is developed, which can efficiently and accurately annotate gait key points. In indoor environments, pixel-level accuracy of 2D gait key point annotation is achieved using a motion capture system and 3D computer vision technology; in outdoor environments, low-cost RGB cameras combined with 3D computer vision technology are used to achieve accurate annotation of long-distance low-resolution images. 3. **Baseline Evaluation**: Various gait recognition algorithms are tested on the DIOR dataset to establish benchmark performance, providing a reference for future research. Through these contributions, the paper aims to advance long-distance gait detection technology, providing technical support for fields such as perimeter security, public safety, and autonomous driving.

DIOR: Dataset for Indoor-Outdoor Reidentification -- Long Range 3D/2D Skeleton Gait Collection Pipeline, Semi-Automated Gait Keypoint Labeling and Baseline Evaluation Methods

Gait Recognition with Drones: A Benchmark

A Self-Supervised Gait Encoding Approach with Locality-Awareness for 3D Skeleton Based Person Re-Identification

A Benchmark for Gait Recognition under Occlusion Collected by Multi-Kinect SDAS

Semi-Supervised Domain Generalizable Person Re-Identification

CASIA-E: A Large Comprehensive Dataset for Gait Recognition

High-precision Human Body Acquisition Via Multi-View Binocular Stereopsis

Gait Recognition in Large-scale Free Environment via Single LiDAR

Distillation-guided Representation Learning for Unconstrained Gait Recognition

Human Body Pose Estimation for Gait Identification: A Comprehensive Survey of Datasets and Models

Multimodal People Re-Identification Using 3D Skeleton, Depth, and Color Information

Towards Generalization of 3D Human Pose Estimation In The Wild

Multi-Modal Human Authentication Using Silhouettes, Gait and RGB

LiDAR-based Person Re-identification

Multi-person 3D pose estimation from unlabelled data

BEHAVE: Dataset and Method for Tracking Human Object Interactions

LWIRPOSE: A novel LWIR Thermal Image Dataset and Benchmark

A Large-Scale Synthetic Gait Dataset Towards In-the-wild Simulation and Comparison Study.

Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments

Automatic gait recognition from a distance