Self-supervised Auxiliary Learning for Texture and Model-based Hybrid Robust and Fair Featuring in Face Analysis

Shukesh Reddy,Nishit Poddar,Srijan Das,Abhijit Das
2024-09-29
Abstract:In this work, we explore Self-supervised Learning (SSL) as an auxiliary task to blend the texture-based local descriptors into feature modelling for efficient face analysis. Combining a primary task and a self-supervised auxiliary task is beneficial for robust representation. Therefore, we used the SSL task of mask auto-encoder (MAE) as an auxiliary task to reconstruct texture features such as local patterns along with the primary task for robust and unbiased face analysis. We experimented with our hypothesis on three major paradigms of face analysis: face attribute and face-based emotion analysis, and deepfake detection. Our experiment results exhibit that better feature representation can be gleaned from our proposed model for fair and bias-less face analysis.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to incorporate texture - based local descriptors into feature modeling through self - supervised learning (SSL) as an auxiliary task, in order to achieve efficient, robust and unbiased facial analysis. Specifically, the paper aims to: 1. **Fuse texture features and model features**: Combine texture - based local descriptors and model - based features to improve the performance of facial analysis tasks. 2. **Enhance the robustness and fairness of feature representation**: By introducing self - supervised learning tasks, especially in the case of limited data, improve the quality of feature representation, making it more robust and unbiased. 3. **Deal with complex changes**: Handle the impact of complex changes such as facial expressions, postures, and lighting on feature extraction. ### Main problems - **Robustness**: How to maintain the stability of facial features under different conditions (such as different lighting, angles, expressions, etc.). - **Fairness**: How to ensure that the facial analysis model performs consistently among different populations (such as gender, race, etc.) and avoid bias. - **Feature fusion**: How to effectively combine local texture features with global model features to improve overall performance. ### Solutions The author proposes a hybrid method that combines texture - based features and model - based features and utilizes self - supervised learning as an auxiliary task. Specific methods include: - **Masked Auto - Encoder (MAE)**: Used to reconstruct texture features, such as local patterns, to enhance the robustness of feature representation. - **Multi - task learning framework**: Combine the main task (such as classification) and self - supervised auxiliary tasks to optimize feature representation. - **Experimental verification**: Experiments were carried out on multiple facial analysis tasks, including facial attribute analysis, emotion analysis, and deepfake detection, verifying the effectiveness of the proposed method. Through these methods, the paper aims to provide a more robust and fairer facial analysis solution.