I-vector Clustering Dictionary and Attention Mechanism Framework for Speaker Adaptation

Jun HUANG,Bing JIANG,Xian-gang LI,Wu-sheng GUO,Li-rong DAI
DOI: https://doi.org/10.3969/j.issn.1000-1220.2019.02.038
2019-01-01
Abstract:Recently, speaker adaptation in speech recognition has been widely used in practical engineering. Using auxiliary input feature i-vector has been seen as one of the most effective ways in speaker adaptation. However, extracting i-vector needs all the data of each sentence, which can not be applied for online adaptation. Therefore, this paper proposes a newadaptive framework based on ivector clustering dictionary and attention mechanism, so as to realize the online adaptation without extracting i-vector and avoiding two decodes while testing. This framework has the advantages of excellent flexibility and expansibility, making it easy to be used for adaptation in other aspects, such as geographical and gender adaptation. We report experimental results on the Switchboard speech recognition task showing that the proposed framework outperforms the baseline on different acoustic models. In addition, the rationality of the proposed framework is further demonstrated by speaker recognition tasks.
What problem does this paper attempt to address?