Automatic multi-speaker speech recognition system based on time-frequency blind source separation under ubiquitous environment

Zhe Wang,Haijian Zhang,Guoan Bi,Xiumei Li
DOI: https://doi.org/10.1109/ICIEA.2014.6931139
2014-01-01
Abstract:In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum and adaptively suppressing noises. An automatic recognition algorithm to adapt with the multi-speaker task is designed and conducted. Evaluation tests are carried out using noise database NOISEX-92 and speech database YOHO Corpus. Experimental results show that the proposed algorithm manages to achieve very impressive improvements.
What problem does this paper attempt to address?