A Improved Speech Synthesis System Utilizing BPSO-based Lip Feature Selection.

Mengjun Wang,Xiangling Wang,Gang Li
DOI: https://doi.org/10.1109/bmei.2011.6098551
2011-01-01
Abstract:To get a higher lipreading recognition result in speech synthesis system driven by visual speech, Binary Particle Swarm Optimization (BPSO) algorithms is used to select the “optimal” lip feature subset. Experiments are carried out based on HMM with 4 states and 16 Gaussian mixture components in a small database for speaker-dependent case. Experiment results show that the integrated discriminate vector after feature selection obtained the information from the geometrical features and the pixel based features. Comparing with feature fusion based on concatenating, the recognition rates with feature selection based on BPSO are improved by as much as 2.42%.
What problem does this paper attempt to address?