Towards Trustable Language Models: Investigating Information Quality of Large Language Models

Rick Rejeleene,Xiaowei Xu,John Talburt
2024-01-24
Abstract:Large language models (LLM) are generating information at a rapid pace, requiring users to increasingly rely and trust the data. Despite remarkable advances of LLM, Information generated by LLM is not completely trustworthy, due to challenges in information quality. Specifically, integrity of Information quality decreases due to unreliable, biased, tokenization during pre-training of LLM. Moreover, due to decreased information quality issues, has led towards hallucination, fabricated information. Unreliable information can lead towards flawed decisions in businesses, which impacts economic activity. In this work, we introduce novel mathematical information quality evaluation of LLM, we furthermore analyze and highlight information quality challenges, scaling laws to systematically scale language models.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the issue of unreliable information quality generated by large language models (LLMs), which affects users' trust in these models. Specifically, the paper focuses on the following aspects: 1. **Decline in Information Integrity**: Due to unreliable and biased data during the pre-training process, as well as tokenization issues, the quality of information generated by LLMs decreases. 2. **Hallucinations and False Information**: Due to information quality issues, LLMs may generate fictitious or incorrect information, which can lead to adverse outcomes in business decisions. 3. **Economic Impact**: Unreliable information can affect business decisions, thereby impacting economic activities. To address these issues, the paper proposes the following contributions: - **Proposes a New Mathematical Formula to Evaluate Information Quality**: Evaluates the quality of information generated by LLMs through three metrics: consistency, relevance, and accuracy, and introduces an evaluation pipeline. - **Explores the Importance of Trust and Data Quality**: Analyzes how unreliable data leads to a decline in LLM performance and emphasizes the necessity of training on larger datasets to improve data quality. - **Investigates the Reasons for Limited LLM Quality**: Analyzes the impact of data quality issues on LLM performance and studies how general and specialized datasets can improve performance in specific domains. - **Explores Scaling Laws**: Such as Chinchilla and Broken Neural Scaling Laws, which help systematically scale language models. - **Proposes Research Directions to Improve LLMs**: Including investigating the theories and principles of LLMs, reducing dependence on manual annotation, and improving data quality. In summary, the paper aims to improve the information quality and user trust in large language models by proposing new evaluation methods and research directions, thereby promoting their widespread application in various fields.