COLLABORATED FEATURE SUPERVECTOR AND COLLABORATED LANGUAGE MODEL FOR PHONOTACTIC LANGUAGE RECOGNITION
Wei wei Liu,Guo Chun Li,Cun Xue Zhang,Tai Qing Dong,Jian Hua Zhou,Yan Miao Song,Jian Zhong Liu,Zhao Peng,Yu Bin Huang,Jian Feng Tong,Jing Fang Lu,Xing Hua He,Yu Jian Tang,Fu Qiang Yuan,Bin Guo,Lei Gao
DOI: https://doi.org/10.1504/ijhi.2019.106787
2019-01-01
International Journal of Hybrid Intelligence
Abstract:The now-acknowledged vulnerabilities of phonotactic language recognition (PLR) technology to long-term contexts have spawned interests to develop many methods to overcome it. In this paper an approach to build a collaborated feature supervector (CFS) built with binary decision tree feature and N-gram feature is proposed, and a collaborated language model (CLM) is introduced, which used to applied to deal with the problems of phonotactic language identification tasks, such as weak to handling long-term contexts and an exponential growth of the number of the parameters in the processing of modelling. Experiments are carried out on the database of National Institute of Standards and Technology Language Recognition Evaluation 2009 (NIST LRE 2009). The experimental results have confirmed that phonotactic language recognition system using the collaborated language model yields 1.07%, 2.68%, 13.48% in equal error rate (EER), which means 8.54%, 12.70% and 4.60% relative reduction for 30 s, 10 s, 3 s compared to the baseline system, respectively.