A Diachronic Study of Chinese Word Length Distribution.

Heng Chen,Haitao Liu
2014-01-01
Glottometrics
Abstract:An investigation of diachronic texts in one language would be appropriate in order to track down the background of the individual parameters of the word length distribution models. The present article investigates how word length evolves based on the analysis of texts from ancient Chinese within a time span of 1000 years. The results show that the parameter a in Zipf-Alekseev's function increases with time, but it is influenced by language policy in modern times, which causes it to decrease a little, but a predictive estimate of the word length distributions shows that the parameter a really increases with time, which means it is an element of a self-organizing system. A deeper investigation into the historical changes of each word length class as well as four statistical indexes of word length distributions reveal that the increase of multi-syllable words is the main trend in historical developments of word length distributions, which may be inter-correlated with the diachronic increase of parameter a. What is more, the diachronic synergetic relation between word length and mean word frequency also reveals the increasing use of multi-syllable words in communications which can be seen from the decrease of the absolute value of the negative parameter b in the function y = ax(b).
What problem does this paper attempt to address?