RNN Language Model with Word Clustering and Class-Based Output Layer

Yongzhe Shi,Wei-Qiang Zhang,Jia Liu,Michael T Johnson
DOI: https://doi.org/10.1186/1687-4722-2013-22
2013-01-01
Abstract:The recurrent neural network language model (RNNLM) has shown significant promise for statistical language modeling. In this work, a new class-based output layer method is introduced to further improve the RNNLM. In this method, word class information is incorporated into the output layer by utilizing the Brown clustering algorithm to estimate a class-based language model. Experimental results show that the new output layer with word clustering not only improves the convergence obviously but also reduces the perplexity and word error rate in large vocabulary continuous speech recognition.
What problem does this paper attempt to address?