Vocal tract characteristic on long-term formant distribution

Xu Yixue,Kong Jiangping
DOI: https://doi.org/10.1109/ICCSNT.2012.6525922
2012-01-01
Abstract:Several ways have been proposed for forensic speaker identification based largely on formants, whose frequencies and dynamics can represent the individual vocal tract in a way. Since speakers now can use other languages in the process of crime, a new question that how to identify different forensic speaker reveals. Given the same recording materials of 3 different languages (Chinese, English and Korean), it is of great value to differentiate the specific speaker even if the language that speaker use is not his or her native language. In this paper, a new method for this case based on formant features and mathematical principles is presented. When talking about formant features, emergence and number of peaks, kurtosis and skewness, which are the most significant among F1, F2, F3 and F4 values, are first extracted for this case of our experiment. Through the comparison of correlations between those features based on long-term formant (LTF) distribution, the similarity and distinction of the three languages spoken by the same speaker can be visualized.
What problem does this paper attempt to address?