CSLP CORPORA AND LANGUAGE RESOURCES

Hsiao-Chuan Wang,Thomas Fang Zheng,Jianhua Tao
DOI: https://doi.org/10.1142/9789812772961_0023
2006-01-01
Abstract:This chapter discusses the fundamental issues related to the development of language resources for Chinese spoken language processing (CSLP). Chinese dialects, transcription systems, and Chinese character sets are described. The general procedure for speech corpus production is introduced, along with the dialect-specific problems related to CSLP corpora. Some activities in the development of CSLP corpora are also presented here. Finally, available language resources for CSLP as well as their related websites are listed.
What problem does this paper attempt to address?