Abstract:It remains challenging how to acquire a human body shape with high precision and evaluate the reconstructed models effectively, because the results can be easily affected by various factors (e.g., the performance of the capture device, the unwanted movement of the subject, and the self-occlusion of the articulated body structure). To tackle the above challenges, this research presents a passive acquisition system, which comprises 60 spatially-configured Digital Single Lens Reflex (DSLR) cameras and a carefully devised algorithmic pipeline for shape acquisition in a single shot. Different from traditional multi-view stereo solutions, the constituent cameras are synchronized and organized into 30 binocular stereo rigs to capture images from multiple views simultaneously. Each binocular stereo rig is regarded as a depth sensor. The acquisition pipeline consists of three stages. First, camera calibration is performed to estimate intrinsic and extrinsic parameters of all cameras, especially for paired binocular cameras. Second, depth inference based on stereo matching is employed to recover reliable depth information from RGB images. A novel hierarchical seed-propagation stereo matching framework is proposed, resulting in 30 dense and uniform-distributed partial point clouds. Finally, a point-based geometry processing step composed of multi-view registration and surface meshing is carried out to obtain high-quality watertight human body shapes. This research also proposes an elaborate and novel method to assess the accuracy of reconstructed non-rigid human body model based on anthropometry parameters, which solves the synchronization of the ground-truth values and the measured values. Experimental results show that the system can achieve the reconstruction accuracy within 2.5 mm in average. (C) 2020 Elsevier Ltd. All rights reserved.

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras

Full-body Motion Capture for Multiple Closely Interacting Persons.

Shape and Pose Estimation for Closely Interacting Persons Using Multi-view Images.

4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras

Marker-Less 3d Human Motion Capture With Monocular Image Sequence And Height-Maps

Outdoor Markerless Motion Capture with Sparse Handheld Video Cameras

Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras

Two-camera-based Human Motion Capture

High-precision Human Body Acquisition Via Multi-View Binocular Stereopsis

Dynamic Human Body Reconstruction and Motion Tracking with Low-Cost Depth Cameras

Accurate realtime full-body motion capture using a single depth camera

HybridFusion: Real-Time Performance Capture Using a Single Depth Sensor and Sparse IMUs

Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras

Reconstructing Close Human Interactions from Multiple Views

Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation

Markerless motion capture of interacting characters using multi-view image segmentation

Multi-person Multi-Camera Tracking for Live Stream Videos Based on Improved Motion Model and Matching Cascade

Real-time Physics-based Motion Capture with Sparse Sensors

Markerless motion capture of multiple characters using multiview image segmentation

Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization