Abstract:It remains challenging how to acquire a human body shape with high precision and evaluate the reconstructed models effectively, because the results can be easily affected by various factors (e.g., the performance of the capture device, the unwanted movement of the subject, and the self-occlusion of the articulated body structure). To tackle the above challenges, this research presents a passive acquisition system, which comprises 60 spatially-configured Digital Single Lens Reflex (DSLR) cameras and a carefully devised algorithmic pipeline for shape acquisition in a single shot. Different from traditional multi-view stereo solutions, the constituent cameras are synchronized and organized into 30 binocular stereo rigs to capture images from multiple views simultaneously. Each binocular stereo rig is regarded as a depth sensor. The acquisition pipeline consists of three stages. First, camera calibration is performed to estimate intrinsic and extrinsic parameters of all cameras, especially for paired binocular cameras. Second, depth inference based on stereo matching is employed to recover reliable depth information from RGB images. A novel hierarchical seed-propagation stereo matching framework is proposed, resulting in 30 dense and uniform-distributed partial point clouds. Finally, a point-based geometry processing step composed of multi-view registration and surface meshing is carried out to obtain high-quality watertight human body shapes. This research also proposes an elaborate and novel method to assess the accuracy of reconstructed non-rigid human body model based on anthropometry parameters, which solves the synchronization of the ground-truth values and the measured values. Experimental results show that the system can achieve the reconstruction accuracy within 2.5 mm in average. (C) 2020 Elsevier Ltd. All rights reserved.

Natural scenes reveal diverse representations of 2D and 3D body pose in the human brain

Complexity of mental geometry for 3D pose perception

A System View of the Recognition and Interpretation of Observed Human Shape, Pose and Action

LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies

Resolving 3D Human Pose Ambiguities with 3D Scene Constraints

Fine-grained neural coding of bodies and body parts in human visual cortex

High-precision Human Body Acquisition Via Multi-View Binocular Stereopsis

Representation of Contextually Related Multiple Objects in the Human Ventral Visual Pathway.

Reconstructing 3D Human Pose from RGB-D Data with Occlusions

Approaching human 3D shape perception with neurally mappable models

Multivariate Analysis of BOLD Activation Patterns Recovers Graded Depth Representations in Human Visual and Parietal Cortex

Visualizing fMRI BOLD responses to diverse naturalistic scenes using retinotopic projection

Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

Subject-Specific Human Modeling for Human Pose Estimation

Moving and Static Faces, Bodies, Objects, and Scenes Are Differentially Represented across the Three Visual Pathways

Real-time human pose recognition in parts from single depth images

Becoming sexy: Contrapposto pose increases attractiveness ratings and modulates observers' brain activity

Behaviorally-relevant features of observed actions dominate cortical representational geometry in natural vision

Implicit Neural Representations With Structured Latent Codes for Human Body Modeling

Naturalistic Object Representations Depend on Distance and Size Cues

3D Human Pose and Shape Estimation with Dense Correspondence from a Single Depth Image