Typological Features of Zhuang from the Perspective of Word Frequency Distribution.

Aiyun Wei,Haitao Liu
2019-01-01
Glottometrics
Abstract:Investigating lexical features with statistical methods has always been a key object of quantitative linguistic research. However, though Zhuang is the mother tongue of the minority with the largest population in China, its lexical features have attracted little attention from the researchers employing quantitative means. Based on a corpus (CZL) of over 500,000 tokens of the Zhuang language, this study addresses the features of word frequency distribution of Zhuang. The results show that Zhuang shares the universal feature of other tested languages in that its word frequency distribution abides by the Zipf's Law and the "Least Effort Principle". The study also tests the word frequency distribution of Zhuang texts of different genres, which shows that for different genres, the values of some parameters, such as b, are different. Moreover, in order to test whether Zhuang language has any distinctive or typological features in word frequency distribution, the values of the h-point and a-index of the texts in CZL are computed as well. It is found that the two indexes are effective in distinguishing Zhuang from other languages, and the position of Zhuang on the analytism-synthetism continuum proposed by Popescu is close to those of the Polynesian language family, which may be helpful for intersubjective placement of Zhuang into a language group. This study would open a new perspective in the statistical lexical research of Zhuang language and present a "new" corroborated language with respect to the laws in quantitative linguistics.
What problem does this paper attempt to address?