Abstract:Training an accurate 3D human pose estimator often requires a large amount of 3D ground-truth data which is inefficient and costly to collect. Previous methods have either resorted to weakly supervised methods to reduce the demand of ground-truth data for training, or using synthetically-generated but photo-realistic samples to enlarge the training data pool. Nevertheless, the former methods mainly require either additional supervision, such as unpaired 3D ground-truth data, or the camera parameters in multiview settings. On the other hand, the latter methods require accurately textured models, illumination configurations and background which need careful engineering. To address these problems, we propose a domain adaptation framework with unsupervised knowledge transfer, which aims at leveraging the knowledge in multi-modality data of the easy-to-get synthetic depth datasets to better train a pose estimator on the real-world datasets. Specifically, the framework first trains two pose estimators on synthetically-generated depth images and human body segmentation masks with full supervision, while jointly learning a human body segmentation module from the predicted 2D poses. Subsequently, the learned pose estimator and the segmentation module are applied to the real-world dataset to unsupervisedly learn a new RGB image based 2D/3D human pose estimator. Here, the knowledge encoded in the supervised learning modules are used to regularize a pose estimator without ground-truth annotations. Comprehensive experiments demonstrate significant improvements over weakly supervised methods when no ground-truth annotations are available. Further experiments with ground-truth annotations show that the proposed framework can outperform state-of-the-art fully supervised methods. In addition, we conducted ablation studies to examine the impact of each loss term, as well as with different amount of supervisions signal.

Unsupervised Domain Adaptation Approach for Vision-Based Semantic Understanding of Bridge Inspection Scenes Without Manual Annotations

Attention-based Cross-Layer Domain Alignment for Unsupervised Domain Adaptation

A New Bidirectional Unsupervised Domain Adaptation Segmentation Framework

Survey on Unsupervised Domain Adaptation for Semantic Segmentation for Visual Perception in Automated Driving

Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data

Unsupervised Domain Adaptation for 3D Human Pose Estimation

Domain Bridge for Unpaired Image-to-Image Translation and Unsupervised Domain Adaptation

A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models

Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer

Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection

Rethinking Unsupervised Domain Adaptation for Semantic Segmentation

An Unsupervised Domain Adaption Framework for Aerial Image Semantic Segmentation Based on Curriculum Learning

Deep Unsupervised Domain Adaptation: A Review of Recent Advances and Perspectives

Style Adaptation for Domain-adaptive Semantic Segmentation

Source-Free Domain Adaptation for Semantic Segmentation

A Fine-Grained Unsupervised Domain Adaptation Framework for Semantic Segmentation of Remote Sensing Images

IterDANet: Iterative Intra-Domain Adaptation for Semantic Segmentation of Remote Sensing Images

Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation Using Region and Category Adaptive Domain Discriminator

GrabDAE: An Innovative Framework for Unsupervised Domain Adaptation Utilizing Grab-Mask and Denoise Auto-Encoder

Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation

Unsupervised Domain Adaptation in Semantic Segmentation: A Review