A Pair-Based Language Model for the Robust Lexical Analysis in Chinese Text-to-speech Synthesis.

Wu Liu,Dezhi Huang,Yuan Dong,Xinnian Mao,Haila Wang
DOI: https://doi.org/10.21437/interspeech.2007-529
2007-01-01
Abstract:This paper presents a robust method of lexical analysis for Chinese text-to-speech (TTS) synthesis using a pair-based Language Model (LM). The traditional way of Chinese lexical analysis simply regards the word segmentation and part-of-speech (POS) tagging as two separated phases. Each of them utilizes its own algorithms and models, Actually, the POS information is useful for word segmentation, and vice versa. Therefore, a pair-based language model is proposed to integrate basic word segmentation, POS tagging and named entity (NE) identification into a unified framework. The objective evaluation indicates that the proposed method achieves the top-level performance, and confirms its effectiveness in Chinese lexical analysis.
What problem does this paper attempt to address?