DIOR: Dataset for Indoor-Outdoor Reidentification -- Long Range 3D/2D Skeleton Gait Collection Pipeline, Semi-Automated Gait Keypoint Labeling and Baseline Evaluation Methods

Yuyang Chen,Praveen Raj Masilamani,Bhavin Jawade,Srirangaraj Setlur,Karthik Dantu
2023-09-22
Abstract:In recent times, there is an increased interest in the identification and re-identification of people at long distances, such as from rooftop cameras, UAV cameras, street cams, and others. Such recognition needs to go beyond face and use whole-body markers such as gait. However, datasets to train and test such recognition algorithms are not widely prevalent, and fewer are labeled. This paper introduces DIOR -- a framework for data collection, semi-automated annotation, and also provides a dataset with 14 subjects and 1.649 million RGB frames with 3D/2D skeleton gait labels, including 200 thousands frames from a long range camera. Our approach leverages advanced 3D computer vision techniques to attain pixel-level accuracy in indoor settings with motion capture systems. Additionally, for outdoor long-range settings, we remove the dependency on motion capture systems and adopt a low-cost, hybrid 3D computer vision and learning pipeline with only 4 low-cost RGB cameras, successfully achieving precise skeleton labeling on far-away subjects, even when their height is limited to a mere 20-25 pixels within an RGB frame. On publication, we will make our pipeline open for others to use.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper primarily addresses the problem of long-distance human gait recognition in both indoor and outdoor environments. Specifically, the contributions of the paper are as follows: 1. **Dataset**: A new dataset named DIOR is proposed, which includes gait data from 14 subjects, covering both indoor and outdoor environments, and includes long-distance (low-resolution) images. The dataset contains 1,649,918 frames of images, of which 802,726 frames are from an indoor motion capture system (MoCap), 847,193 frames are from outdoor environments, and 211,798 frames are long-distance images. 2. **Semi-Automatic Annotation Pipeline**: A semi-automatic annotation method is developed, which can efficiently and accurately annotate gait key points. In indoor environments, pixel-level accuracy of 2D gait key point annotation is achieved using a motion capture system and 3D computer vision technology; in outdoor environments, low-cost RGB cameras combined with 3D computer vision technology are used to achieve accurate annotation of long-distance low-resolution images. 3. **Baseline Evaluation**: Various gait recognition algorithms are tested on the DIOR dataset to establish benchmark performance, providing a reference for future research. Through these contributions, the paper aims to advance long-distance gait detection technology, providing technical support for fields such as perimeter security, public safety, and autonomous driving.