Weakly Supervised Pretraining and Multi-Annotator Supervised Finetuning for Facial Wrinkle Detection

Ik Jun Moon,Junho Moon,Ikbeom Jang
2024-08-19
Abstract:1. Research question: With the growing interest in skin diseases and skin aesthetics, the ability to predict facial wrinkles is becoming increasingly important. This study aims to evaluate whether a computational model, convolutional neural networks (CNN), can be trained for automated facial wrinkle segmentation. 2. Findings: Our study presents an effective technique for integrating data from multiple annotators and illustrates that transfer learning can enhance performance, resulting in dependable segmentation of facial wrinkles. 3. Meaning: This approach automates intricate and time-consuming tasks of wrinkle analysis with a deep learning framework. It could be used to facilitate skin treatments and diagnostics.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: with the increasing attention to skin diseases and skin aesthetics, the ability to predict facial wrinkles is becoming more and more important. Specifically, the research aims to evaluate whether automated facial wrinkle segmentation can be achieved using computational models such as convolutional neural networks (CNN). ### Research Background and Problems 1. **Importance of Facial Wrinkles** - Facial wrinkles are an important sign of aging and play an important role in skin condition assessment, skin care, and early diagnosis of skin diseases. - Manual analysis of facial wrinkles in a large number of images is time - consuming and labor - intensive, and the subjectivity in the manual segmentation process will reduce the reliability of research results. 2. **Limitations of Existing Methods** - Traditional methods rely on manual annotation, which not only consumes a large amount of time and resources but also has problems of subjectivity and inconsistency. - The insufficiency of dataset annotation limits the performance improvement of deep - learning models. ### Research Objectives To solve the above problems, this paper proposes a deep - learning - based method to achieve automated facial wrinkle segmentation through the following means: - **Weakly - Supervised Pretraining**: Use texture maps to generate preliminary segmentation results and reduce the dependence on large - scale annotated data. - **Multi - Annotator - Supervised Fine - Tuning**: Combine the annotation data of multiple annotators, reduce the subjectivity of annotation through the majority - voting mechanism, and improve the accuracy of segmentation. - **Transfer Learning**: Use the knowledge of pre - trained models to fine - tune on limited annotated data, thereby improving the performance of the model. ### Solutions 1. **Dataset Selection and Processing** - Use the FFHQ dataset, which contains 70,000 high - quality face images. - Extract texture maps from these images as labels for weakly - supervised pretraining, and perform mask processing on the facial area through the BiSeNet architecture to generate the final texture mask. 2. **Model Training Process** - **Weakly - Supervised Pretraining**: Use the U - Net architecture, input RGB images, output texture masks, and train for 300 epochs. - **Supervised Fine - Tuning**: Use manually - annotated wrinkle masks for fine - tuning and train for 150 epochs. To evaluate the effect of transfer learning, experiments are carried out using 100%, 50%, 25%, and 5% of the data respectively. 3. **Evaluation Metrics** - Use the Jaccard Similarity Index (JSI) as an evaluation metric, and the formula is as follows: \[ J S I=\frac{|A \cap B|}{|A \cup B|} \] where \(A\) is the predicted segmentation result and \(B\) is the actual label. ### Results and Discussion - The experimental results show that, compared with the model trained only with manually - annotated data, the method proposed in this paper shows better performance under different data proportions, especially when the amount of data is small, the performance improvement is particularly significant. - Compared with the self - supervised - learning pretraining method, the method in this paper also shows higher accuracy. In conclusion, this research effectively improves the automation level and accuracy of facial wrinkle segmentation through the combination of weakly - supervised pretraining and multi - annotator - supervised fine - tuning, providing strong support for skin treatment and diagnosis.