Abstract:It remains challenging how to acquire a human body shape with high precision and evaluate the reconstructed models effectively, because the results can be easily affected by various factors (e.g., the performance of the capture device, the unwanted movement of the subject, and the self-occlusion of the articulated body structure). To tackle the above challenges, this research presents a passive acquisition system, which comprises 60 spatially-configured Digital Single Lens Reflex (DSLR) cameras and a carefully devised algorithmic pipeline for shape acquisition in a single shot. Different from traditional multi-view stereo solutions, the constituent cameras are synchronized and organized into 30 binocular stereo rigs to capture images from multiple views simultaneously. Each binocular stereo rig is regarded as a depth sensor. The acquisition pipeline consists of three stages. First, camera calibration is performed to estimate intrinsic and extrinsic parameters of all cameras, especially for paired binocular cameras. Second, depth inference based on stereo matching is employed to recover reliable depth information from RGB images. A novel hierarchical seed-propagation stereo matching framework is proposed, resulting in 30 dense and uniform-distributed partial point clouds. Finally, a point-based geometry processing step composed of multi-view registration and surface meshing is carried out to obtain high-quality watertight human body shapes. This research also proposes an elaborate and novel method to assess the accuracy of reconstructed non-rigid human body model based on anthropometry parameters, which solves the synchronization of the ground-truth values and the measured values. Experimental results show that the system can achieve the reconstruction accuracy within 2.5 mm in average. (C) 2020 Elsevier Ltd. All rights reserved.

A Multi-View Skeleton Data Fusion Method Based on BP Neural Network

Human Pose Tracking Algorithm Based on Skeleton-Texture Model

A Skeleton and Visual Tracking Fusion Based Person-Following System for Mobile Service Robots

3D Articulated Skeleton Extraction Using a Single Consumer-Grade Depth Camera.

Mmskeleton: 3D Human Skeleton Estimation Using Millimeter Wave Radar Sparse Point Clouds

Optimization of Human Posture Recognition based on Multi-view Skeleton Data Fusion

Multi-Kinects fusion for full-body tracking in virtual reality-aided assembly simulation

Dynamic Human Body Reconstruction and Motion Tracking with Low-Cost Depth Cameras

High-precision Human Body Acquisition Via Multi-View Binocular Stereopsis

Markerless 3D Skeleton Tracking Algorithm by Merging Multiple Inaccurate Skeleton Data from Multiple RGB-D Sensors

Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation

Parametric Human Body Reconstruction Based on Sparse Key Points.

Constraint-Based Optimized Human Skeleton Extraction from Single-Depth Camera

Tracking Human-like Natural Motion Using Deep Recurrent Neural Networks

Multiple Kinect based system to monitor and analyze key performance indicators of physical training

Marker-Less 3d Human Motion Capture With Monocular Image Sequence And Height-Maps

Unsupervised Articulated Skeleton Extraction from Point Set Sequences Captured by a Single Depth Camera

Motion Projection Consistency Based 3D Human Pose Estimation with Virtual Bones from Monocular Videos

PointSkelCNN: Deep Learning-Based 3D Human Skeleton Extraction from Point Clouds

Human Motion Tracking by Multiple RGBD Cameras.

Fusion Poser: 3D Human Pose Estimation Using Sparse IMUs and Head Trackers in Real Time