State-Time-Alignment Phone Clustering Based Language-independent Phone Recognizer Front-end for Phonotactic Language Recognition.

Wei-Wei Liu,Guo-Chun Li,Cun-Xue Zhang,Hai-Feng Yan,Jing He,Yan-Miao Song,Ying-Xin Gan,Jian-Zhong Liu,Ying Yin,Ya-Nan Li,Zhao Peng,Yu-Bin Huang,Xi-Bo Zhang,Jian-Feng Tong,Xing-Hua He,Fu-Qiang Yuan,Hui-Qi Tao,Bao-Zhu Zhao
DOI: https://doi.org/10.1109/iccse.2019.8845351
2019-01-01
Abstract:The now-acknowledged sensitive of Phonotactic Language Recognition (PLR) technology to the performance of the phone recognizer front-end have spawned interests to develop many methods to improve it. In this paper a state-of-art State-Time-Alignment (STA) phone clustering approach to build language-independent phone recognizer is proposed in phonotactic language recognition system to balance the performance and the complexity of the speech tokenizing processing in PLR.Experiments are carried out on the database of National Institute of Standards and Technology language recognition evaluation 2009 (NIST LRE 2009) and the experimental results have confirmed that phonotactic language recognition system using the collaborated language model yields 1.84%, 5.55% and 16.82% in equal error rate (EER), which show that the STA phone clustering based phone recognizer front-end outperforms the original English and Mandaren phone recognizers and other phone clustering methods based phone recognizer.
What problem does this paper attempt to address?