Data Augmentation in Human-Centric Vision

Wentao Jiang,Yige Zhang,Shaozhong Zheng,Si Liu,Shuicheng Yan
DOI: https://doi.org/10.1007/s44336-024-00002-9
2024-01-01
Abstract:This survey presents a comprehensive analysis of data augmentation techniquesin human-centric vision tasks, a first of its kind in the field. It delves intoa wide range of research areas including person ReID, human parsing, human poseestimation, and pedestrian detection, addressing the significant challengesposed by overfitting and limited training data in these domains. Our workcategorizes data augmentation methods into two main types: data generation anddata perturbation. Data generation covers techniques like graphic engine-basedgeneration, generative model-based generation, and data recombination, whiledata perturbation is divided into image-level and human-level perturbations.Each method is tailored to the unique requirements of human-centric tasks, withsome applicable across multiple areas. Our contributions include an extensiveliterature review, providing deep insights into the influence of theseaugmentation techniques in human-centric vision and highlighting the nuances ofeach method. We also discuss open issues and future directions, such as theintegration of advanced generative models like Latent Diffusion Models, forcreating more realistic and diverse training data. This survey not onlyencapsulates the current state of data augmentation in human-centric vision butalso charts a course for future research, aiming to develop more robust,accurate, and efficient human-centric vision systems.
What problem does this paper attempt to address?