Playing with Words: Comparing the Vocabulary and Lexical Richness of ChatGPT and Humans

Pedro Reviriego,Javier Conde,Elena Merino-Gómez,Gonzalo Martínez,José Alberto Hernández
DOI: https://doi.org/10.48550/arXiv.2308.07462
2023-08-31
Abstract:The introduction of Artificial Intelligence (AI) generative language models such as GPT (Generative Pre-trained Transformer) and tools such as ChatGPT has triggered a revolution that can transform how text is generated. This has many implications, for example, as AI-generated text becomes a significant fraction of the text, would this have an effect on the language capabilities of readers and also on the training of newer AI tools? Would it affect the evolution of languages? Focusing on one specific aspect of the language: words; will the use of tools such as ChatGPT increase or reduce the vocabulary used or the lexical richness? This has implications for words, as those not included in AI-generated content will tend to be less and less popular and may eventually be lost. In this work, we perform an initial comparison of the vocabulary and lexical richness of ChatGPT and humans when performing the same tasks. In more detail, two datasets containing the answers to different types of questions answered by ChatGPT and humans, and a third dataset in which ChatGPT paraphrases sentences and questions are used. The analysis shows that ChatGPT tends to use fewer distinct words and lower lexical richness than humans. These results are very preliminary and additional datasets and ChatGPT configurations have to be evaluated to extract more general conclusions. Therefore, further research is needed to understand how the use of ChatGPT and more broadly generative AI tools will affect the vocabulary and lexical richness in different types of text and languages.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to evaluate the differences in vocabulary size and vocabulary richness between artificial intelligence - generated language tools (such as ChatGPT) and humans when performing the same tasks. Specifically, the researchers are concerned with: 1. **Comparison of vocabulary size and vocabulary richness**: By comparing the vocabulary size and vocabulary richness used by ChatGPT and humans when answering different types of questions and restating sentences, explore whether AI tools will lead to a decrease in the usage frequency of certain words and even their gradual disappearance. 2. **Potential impacts**: Analyze the impacts of such differences on language learning, language use, and the training of future AI tools. For example, if AI - generated content occupies most of the text on the Internet, this may form a feedback loop, making new AI tools rely on the content generated by previous AI tools, which may affect the quality of new AI tools. 3. **Methods and datasets**: The researchers used multiple datasets, including TOEFL compositions, medical, computer science, finance, and answers to open - ended questions, as well as restatements of sentences and questions. These datasets respectively contain the texts of ChatGPT and humans for comparative analysis. 4. **Preliminary results**: The study found that ChatGPT uses a smaller vocabulary size and has lower vocabulary richness in most cases. However, these results are preliminary and more data and different versions of ChatGPT are required for further verification. In general, this paper aims to preliminarily explore the differences in vocabulary use between ChatGPT and humans through quantitative analysis and discuss the potential impacts of these differences on future language development and AI technology.