Abstract:We use Gaussian Mixture Models to approximate the confidence maps analytically.We also use Gaussian Mixture Models to model the distribution of face shape.The posterior is maximized by iteratively maximizing a lower bound of it.High-order geometry constraint within facial features are considered.Model based Hough Voting scheme is proposed to estimate an initial face shape. Detecting predefined facial feature points in a human face image is a well studied problem. Despite the impressive achievements that have been made, it is still open under unconstrained environments, with variations of illumination, expression, head pose, as well as partial occlusions. This paper proposed a novel method to locate facial feature points under the variations mentioned above. Support vector machines with probability outputs are trained to provide the observation probability of each facial feature point. The observation probabilities, as well as the distribution of face shape which serves as the prior to constrain the relative position of the facial feature points, are both approximated with Gaussian Mixture Models. The problem is solved by maximizing the posterior which combines the prior and observation probability within the framework of Bayesian Inference. An optimization algorithm is developed to maximize the posterior by iteratively maximizing the lower bound of it. The proposed method preserves the high-order geometric constraint within facial feature points. With a simple initialization method of Model based Hough Voting, the method shows competitive detecting rate and locating accuracy on the LFPW and LFW datasets, compared to the methods of state-of-the-art.

Voice activity detection based on sequential Gaussian mixture model with maximum likelihood criterion

Applying Support Vector Machines to Voice Activity Detection

Facial feature points detecting based on Gaussian Mixture Models

A robust voice activity detector based on Weibull and Gaussian Mixture distribution

Improved voice activity detection based on statistical likelihood ratio test

An efficient voice activity detection algorithm by combining statistical model and energy detection

Voice Activity Detection Based on Conjugate Subspace Matching Pursuit and Likelihood Ratio Test

Discriminative Dynamic Gaussian Mixture Selection with Enhanced Robustness and Performance for Multi-Accent Speech Recognition

Speaker Identification based on LSP and Gaussian Mixture Model

Automatic Audio Classification Based on EMGD_HMM

A genetic classification method for speaker recognition

Boosted Mixture Learning of Gaussian Mixture Hidden Markov Models Based on Maximum Likelihood for Speech Recognition

Maximum Probability Increase Estimation Method for Fast Gaussian Likelihood Computations

Boosted Mixture Learning of Gaussian Mixture HMMs for Speech Recognition.

Structural Risk Minimization Principle Based Gaussian Mixture Modeling

GMM-based Voice Conversion with Explicit Modelling on Feature Transform

Multi-task Joint-Learning for Robust Voice Activity Detection

Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test

Multimodal Voice Activity Detection

A Modified Map Criterion Based on Hidden Markov Model for Voice Activity Detecion

Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models