Abstract:In this paper,a new speaker adaptation method—codebook-based speaker adaptation,which could combine the advantages of transform method with Bayes adaptive learning method appropriately,is presented.Not only can the speaker adaptation system improve its performance for small amount of adaptation data,but it can also approach asymptotically matched-condition performance with increasing number of adaptation data.The adaptation process can be divided into two stages.In the first stage,for approximating the acoustic parameters of a target speaker,the linear combination of lots of reference speaker's codebooks is proposed.An effective algorithm based on Rosen gradient projection method is developed to count the weight of each codebook in the linear combination.In the second stage,the combination of codebooks is used as the prior probability,then Bayes adaptive learning method is used to learn the exact value of the target speaker's codebook as more adaptation data are gathered.Thus incremental speaker adaptation can be achieved.As an illustration,this method is applied to a speaker independent continuous speech recognition system for the Chinese language.A series of comparative experiments were conducted to evaluate the performance of the proposed method.The results have shown it is quite promising.

Search And Classification Based Language Model Adaptation

Language model adaptation based on correction information for interactive speech transcription

Agmma: A Novel Incremental Adaptation Method And Its Application To Speaker Recognition

A New Topic-Based Language Model Adaptation

Just-in-time Latent Semantic Adaptation on Language Model for Chinese Speech Recognition Using Web Data

A Language Model Adaptation Approach Based on Text Classification.

Language Model Adaptation Based on the Classification of a Trigram's Language Style Feature

An Active Learning Approach to Task Adaptation.

Codebook-Based Speaker Adaptation

Label Transform Based Cross-Language Speaker Adaptation in Bilingual (Mandarin-English) TTS

Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR

Attention-Guided Adaptation for Code-Switching Speech Recognition

An Improved Cross-Language Model Adaptation Method for Speech Synthesis

A Public Chinese Dataset for Language Model Adaptation

Speaker adaptation using maximum likelihood model interpolation

Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis

An Online Incremental Language Model Adaptation Method

Phoneme Dependent Speaker Embedding And Model Factorization For Multi-Speaker Speech Synthesis And Adaptation

Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information.

Speech Recognition Using Speaker Adaptation by System Parameter Transformation.

Dynamic Speaker Selected Training for Rapid Speaker Adaptation