Word Length in Political Public Speaking: Distribution and Time Evolution

Natalia L. Tsizhmovska,Leonid M. Martyushev
DOI: https://doi.org/10.3390/e26030180
IF: 2.738
2024-02-21
Entropy
Abstract:In this paper, word length in the texts of public speeches by USA and UK politicians is analyzed. More than 300 speeches delivered over the past two hundred years were studied. It is found that the lognormal distribution better describes the distribution of word length than do the Weibull and Poisson distributions, for example. It is shown that the length of words does not change significantly over time (the average value either does not change or slightly decreases, and the mode slightly increases). These results are fundamentally different from those obtained previously for sentence lengths and indicate that, in terms of quantitative linguistic analysis, the word length in politicians' speech has not evolved over the last 200 years and does not obey the principle of least effort proposed by G. Zipf.
physics, multidisciplinary
What problem does this paper attempt to address?
This paper analyzes the word length in public speeches by politicians in the United States and the United Kingdom, based on the study of over 300 speeches spanning more than 200 years. The study finds that the distribution of word length is better described by a lognormal distribution than by Weibull and Poisson distributions. Despite previous research suggesting a decrease in sentence length over time, word length has not significantly changed, with average values either remaining constant or slightly decreasing, and patterns slightly increasing. These findings contradict Zipf's principle of least effort, which posits that language use tends to minimize effort. The authors use quantitative linguistic methods to investigate whether word length adheres to the principle of least effort, i.e. whether word length decreases over time. They analyze speech texts from different centuries and find that word length distribution is mainly characterized by a lognormal distribution, rather than a Weibull distribution, and that word length remains relatively stable over time, in contrast to the changing trend in sentence length. The paper utilizes a large corpus of political speech texts from different eras to ensure the representativeness and consistency of the data. Statistical analysis shows that although word length distribution can be well described by a lognormal distribution, there is no clear temporal evolution trend, which contradicts the predictions of Zipf's Law of Least Effort and Menzerath–Altmann's Law. In summary, this paper aims to address the question of whether word length in political public speeches follows the principle of least effort in language evolution and how it changes over time. The research findings indicate that, at least in the analyzed political speeches, word length has not significantly changed over the past 200 years, challenging some known linguistic regularities.