A Quantitative Linguistic Study on the Relationship between Word Length and Word Frequency

邓耀臣,冯志伟
2013-01-01
Abstract:The present paper reports on a corpus-based quantitative linguistic study on the relationship between word length and word frequency in Chinese within the theoretical framework of Synergetic Linguistics.Results show a high dependency of word frequency on word length.The longer a word is,the less frequently it is used in discourse,which reflects an inverse relation between these two properties.The power model y = axb is proved to fit best the data and capture this regularity.The results further indicate that the parameter of this model,a,is powerful in distinguishing the texts of different styles.The results of the study not only complement the current theories on the relationship between word length and word frequency,providing new evidence for the relationship as a linguistic universal,but also offer a new paradigm for style identification and text classification.
What problem does this paper attempt to address?