Domain Adaptation with Clustered Language Models

Joerg P. Ueberla
DOI: https://doi.org/10.48550/arXiv.cmp-lg/9703001
1997-03-04
Abstract:In this paper, a method of domain adaptation for clustered language models is developed. It is based on a previously developed clustering algorithm, but with a modified optimisation criterion. The results are shown to be slightly superior to the previously published 'Fillup' method, which can be used to adapt standard n-gram models. However, the improvement both methods give compared to models built from scratch on the adaptation data is quite small (less than 11% relative improvement in word error rate). This suggests that both methods are still unsatisfactory from a practical point of view.
Computation and Language
What problem does this paper attempt to address?