Which phonemes will distinguish the different regions within the same dialect?

Xuefei Liu,Jianhua Tao,Yurong Han,Chenglong Wang,Xueying Zheng,Zhengqi Wen
DOI: https://doi.org/10.1109/O-COCOSDA202152914.2021.9660466
2021-01-01
Abstract:The work of finding which phonemes distinguish the different regions within the same dialect will be of great significance to the improvement the recognition technology, national and information security and dialect protection. To address this issue, this paper investigates the refinement recognition of different regions of the same dialect based on the corpus designed and recorded by CASIA (CASIA Dialect Corpus, CASIA DC) firstly and then finds the distinguishing phonemes by the probabilistic accumulation of phonemes method. Based on i-vector model, the recognition results indicate that the recognition rates of different regions of the same dialect are different from one dialect to another. The recognition rate of the Mandarin dialects is lower than that of other dialects. Through the probabilistic accumulation of phonemes, we find that the phonemes with significant difference can distinguish different regions of the same dialect, which will provide significance for the synthesis and recognition of dialects in the future.
What problem does this paper attempt to address?