Speaker identification under mismatched speaking manner based on joint factor analysis

Qingfang Zhang,Heming Zhao,Xiaojiang Gu
2012-01-01
Abstract:In order to increase the recognition rate when trained mainly with normal speech but tested with whispered speech, the paper treats whispered speech and normal speech as two different channels, and introduces joint factor analysis into the recognition system. It estimates the speaker space and channel space, then uses speaker factor to train speaker model, and uses channel factor to eliminate the influence result from the mismatched speaking manner. The paper analyzes the recognition rate on the different numbers of speaker factors and channel factors and compares the results based on different background models. The experimental result shows that the joint factor analysis can greatly increase the recognition rate under the mismatched condition, and this method is practical.
What problem does this paper attempt to address?