Abstract:It remains challenging how to acquire a human body shape with high precision and evaluate the reconstructed models effectively, because the results can be easily affected by various factors (e.g., the performance of the capture device, the unwanted movement of the subject, and the self-occlusion of the articulated body structure). To tackle the above challenges, this research presents a passive acquisition system, which comprises 60 spatially-configured Digital Single Lens Reflex (DSLR) cameras and a carefully devised algorithmic pipeline for shape acquisition in a single shot. Different from traditional multi-view stereo solutions, the constituent cameras are synchronized and organized into 30 binocular stereo rigs to capture images from multiple views simultaneously. Each binocular stereo rig is regarded as a depth sensor. The acquisition pipeline consists of three stages. First, camera calibration is performed to estimate intrinsic and extrinsic parameters of all cameras, especially for paired binocular cameras. Second, depth inference based on stereo matching is employed to recover reliable depth information from RGB images. A novel hierarchical seed-propagation stereo matching framework is proposed, resulting in 30 dense and uniform-distributed partial point clouds. Finally, a point-based geometry processing step composed of multi-view registration and surface meshing is carried out to obtain high-quality watertight human body shapes. This research also proposes an elaborate and novel method to assess the accuracy of reconstructed non-rigid human body model based on anthropometry parameters, which solves the synchronization of the ground-truth values and the measured values. Experimental results show that the system can achieve the reconstruction accuracy within 2.5 mm in average. (C) 2020 Elsevier Ltd. All rights reserved.

Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras with Human Semantics

Dynamic Multi-Person Mesh Recovery From Uncalibrated Multi-View Cameras

CAMInterHand: Cooperative Attention for Multi-View Interactive Hand Pose and Mesh Reconstruction

Human Mesh Recovery from Arbitrary Multi-view Images

Marker-Less 3d Human Motion Capture With Monocular Image Sequence And Height-Maps

Dynamic Human Body Reconstruction and Motion Tracking with Low-Cost Depth Cameras

Multi-view Human Body Mesh Translator

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction

High-precision Human Body Acquisition Via Multi-View Binocular Stereopsis

MH‐HMR: Human mesh recovery from monocular images via multi‐hypothesis learning

Learning Local Recurrent Models for Human Mesh Recovery

Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

Two-camera-based Human Motion Capture

Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos

Synergistic Global-space Camera and Human Reconstruction from Videos

Synergetic Reconstruction from 2D Pose and 3D Motion for Wide-Space Multi-Person Video Motion Capture in the Wild

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras

Humans as Checkerboards: Calibrating Camera Motion Scale for World-Coordinate Human Mesh Recovery

PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos

UnstructuredFusion: Realtime 4D Geometry and Texture Reconstruction Using Commercial RGBD Cameras.