Learning from Longitudinal Face Demonstration - Where Tractable Deep Modeling Meets Inverse Reinforcement Learning

Chi Nhan Duong,Kha Gia Quach,Khoa Luu,T. Hoang Ngan Le,Marios Savvides,Tien D. Bui
DOI: https://doi.org/10.48550/arXiv.1711.10520
2019-02-03
Abstract:This paper presents a novel Subject-dependent Deep Aging Path (SDAP), which inherits the merits of both Generative Probabilistic Modeling and Inverse Reinforcement Learning to model the facial structures and the longitudinal face aging process of a given subject. The proposed SDAP is optimized using tractable log-likelihood objective functions with Convolutional Neural Networks (CNNs) based deep feature extraction. Instead of applying a fixed aging development path for all input faces and subjects, SDAP is able to provide the most appropriate aging development path for individual subject that optimizes the reward aging formulation. Unlike previous methods that can take only one image as the input, SDAP further allows multiple images as inputs, i.e. all information of a subject at either the same or different ages, to produce the optimal aging path for the given subject. Finally, SDAP allows efficiently synthesizing in-the-wild aging faces. The proposed model is experimented in both tasks of face aging synthesis and cross-age face verification. The experimental results consistently show SDAP achieves the state-of-the-art performance on numerous face aging databases, i.e. FG-NET, MORPH, AginG Faces in the Wild (AGFW), and Cross-Age Celebrity Dataset (CACD). Furthermore, we also evaluate the performance of SDAP on large-scale Megaface challenge to demonstrate the advantages of the proposed solution.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to provide the most appropriate age - development path for each individual during the facial aging process. Specifically, the existing facial aging methods face the following challenges: 1. **Non - linearity**: Since human aging is a complex and highly non - linear process, traditional linear models (such as prototype - based, AAMs - based methods and 3DMM - based methods) cannot effectively explain the changes in aging, and the quality of the synthesis results is very limited. 2. **Loss function**: Using a fixed reconstruction loss function (such as the ℓ2 norm) in the proposed deep structure usually produces blurry synthesis results. 3. **Manageability**: Although taking advantage of the probabilistic graphical model to introduce potential directions for the design of deep models and producing significant synthesis results in the age - progression task, the availability of data and the fixity of aging paths are still problems. 4. **Data utilization**: Even if a subject has multiple photos of the same age in the training / testing set, these methods only use one photo for learning or synthesis, and the other photos are usually wasted. Moreover, the aging - transformation embeddings in these methods can only be carried out between photos of two age groups. 5. **Fixed aging - development path**: The learned aging - development paths are uniformly applied to all subjects, which is not realistic in reality. In fact, each subject should have its own aging - development path. To overcome these challenges, this paper proposes a new Subject - based Deep Aging Path (SDAP) model, which combines the advantages of generative probability modeling and inverse reinforcement learning, aiming to discover the optimal aging - development path for each individual. The specific contributions are as follows: 1. **Aging - transformation embedding**: A tractable log - likelihood density estimate is designed, combined with a Convolutional Neural Network (CNN) structure and an age controller, to indicate the amount of aging change that should be added during the synthesis process. In this way, the proposed SDAP can provide smoother synthesis results and maximize the utilization of aging data. 2. **Individualized aging - development path**: Different from most previous methods, the proposed SDAP model further enhances the ability to discover the optimal aging - development path for an individual. This is achieved by embedding the transformation of the entire aging sequence in an inverse reinforcement learning framework. 3. **Automatically optimized objective function**: Different from the predefined or ad - hoc aging rewards and objective functions in most previous works, the proposed method allows the algorithm to automatically determine the optimal objective function and parameters during training through a data - driven strategy. In conclusion, this paper aims to provide a more flexible and personalized method for the facial aging process by combining generative probability modeling and inverse reinforcement learning, thereby improving the quality and practicality of the synthesis results.