THUYG-20:A free Uyghur speech database

Rouzi Aisikaer,Shi YIN,Zhiyong ZHANG,Dong WANG,Hamdulla Askar,Fang ZHENG
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2017.22.012
2017-01-01
Abstract:Speech data plays a fundamental role in research on speech recognition. However, there are few open speech databases available for researchers in China, especially for minor languages such as Uyghur. This paper develops a Uyghur continuous speech database which is totally open and free. The database consists of 20h of training speech and 1h of test speech, as well as all the resources needed to construct a full Uyghur speech recognition system, including aphone set, lexicon, and text data. A recipe used to construct the baseline system is also described with results for two test sets involving clean speech and noisy speech. This paper provides a standard database for Uyghur speech recognition.
What problem does this paper attempt to address?