S‐LASSIE: Structure and smoothness enhanced learning from sparse image ensemble for 3D articulated shape reconstruction
Jingze Feng,Chong He,Guorui Wang,Meili Wang
DOI: https://doi.org/10.1002/cav.2277
IF: 1.01
2024-05-31
Computer Animation and Virtual Worlds
Abstract:This paper introduces S‐LASSIE, a novel method for reconstructing quadrupeds from monocular sparse images. Utilizing a multi‐angle projection loss function and a bone fusion smoothing technique, this approach enhances reconstruction results and ensures that the resulting mesh possesses a comprehensive topological structure. In computer vision, the task of 3D reconstruction from monocular sparse images poses significant challenges, particularly in the field of animal modelling. The diverse morphology of animals, their varied postures, and the variable conditions of image acquisition significantly complicate the task of accurately reconstructing their 3D shape and pose from a monocular image. To address these complexities, we propose S‐LASSIE, a novel technique for 3D reconstruction of quadrupeds from monocular sparse images. It requires only 10–30 images of similar breeds for training. To effectively mitigate depth ambiguities inherent in monocular reconstructions, S‐LASSIE employs a multi‐angle projection loss function. In addition, our approach, which involves fusion and smoothing of bone structures, resolves issues related to disjointed topological structures and uneven connections at junctions, resulting in 3D models with comprehensive topologies and improved visual fidelity. Our extensive experiments on the Pascal‐Part and LASSIE datasets demonstrate significant improvements in keypoint transfer, overall 2D IOU and visual quality, with an average keypoint transfer and overall 2D IOU of 59.6% and 86.3%, respectively, which are superior to existing techniques in the field.
computer science, software engineering