Understanding Literary Texts by LLMs: A Case Study of Ancient Chinese Poetry

Cheng Zhao,Bin Wang,Zhen Wang

2024-09-11

Abstract:The birth and rapid development of large language models (LLMs) have caused quite a stir in the field of literature. Once considered unattainable, AI's role in literary creation is increasingly becoming a reality. In genres such as poetry, jokes, and short stories, numerous AI tools have emerged, offering refreshing new perspectives. However, it's difficult to further improve the quality of these works. This is primarily because understanding and appreciating a good literary work involves a considerable threshold, such as knowledge of literary theory, aesthetic sensibility, interdisciplinary knowledge. Therefore, authoritative data in this area is quite lacking. Additionally, evaluating literary works is often complex and hard to fully quantify, which directly hinders the further development of AI creation. To address this issue, this paper attempts to explore the mysteries of literary texts from the perspective of LLMs, using ancient Chinese poetry as an example for experimentation. First, we collected a variety of ancient poems from different sources and had experts annotate a small portion of them. Then, we designed a range of comprehension metrics based on LLMs to evaluate all these poems. Finally, we analyzed the correlations and differences between various poem collections to identify literary patterns. Through our experiments, we observed a series of enlightening phenomena that provide technical support for the future development of high-level literary creation based on LLMs.

Computation and Language

What problem does this paper attempt to address?

The main problem this paper attempts to address is the understanding and evaluation of classical literary works, particularly ancient Chinese poetry, through large language models (LLMs). Specifically, the paper aims to: 1. **Address the challenge of evaluating high-quality literary creation**: The current evaluation of AI-generated literary works primarily relies on human assessment. When the quality of the work reaches a certain level, non-expert evaluations become unreliable, while expert evaluations are difficult to scale. Therefore, the paper proposes a literature understanding framework based on LLMs to improve the efficiency and accuracy of literary work quality evaluation. 2. **Explore methods for understanding literary texts**: By collecting ancient poems from different sources and having experts annotate some of the poems, a series of LLM-based understanding metrics are designed to evaluate these poems. These metrics include, but are not limited to, lexical statistics, embedding vectors, hidden states, and output probabilities. 3. **Identify literary patterns**: By analyzing the metrics of different poetry collections, the paper identifies patterns and differences in literary works, thereby providing technical support for future advanced literary creation. 4. **Quantify the evaluation of literary works**: Using a small amount of expert-annotated data to address the challenges of evaluating literary texts, the paper demonstrates the high scalability of this method. It argues that with the support of LLMs, the evaluation of more types of literary works can be effectively quantified, promoting the prosperous development of AI literary creation. In summary, through a series of experiments and technical means, this paper explores how to better understand and evaluate ancient Chinese poetry using LLMs, providing new ideas and technical support for the high-quality development of future literary creation.

Understanding Literary Texts by LLMs: A Case Study of Ancient Chinese Poetry

Analyzing Nobel Prize Literature with Large Language Models

Can AI Write Classical Chinese Poetry like Humans? An Empirical Study Inspired by Turing Test

Applying Large Language Models for Automated Essay Scoring for Non-Native Japanese

Evaluating Large Language Model Creativity from a Literary Perspective

Artificial Intelligence Empowers Emotional Expression and Aesthetic Imagery in Modern Chinese Literature

Analysis of Ancient Literary Creation and Literary Criticism in the Information Age

Generation of Chinese classical poetry based on pre-trained model

Large Language Model Displays Emergent Ability to Interpret Novel Literary Metaphors

When Young Scholars Cooperate with LLMs in Academic Tasks: The Influence of Individual Differences and Task Complexities

AC-EVAL: Evaluating Ancient Chinese Language Understanding in Large Language Models

A Comparative Study of Different Models in Ancient Poetry Translation

A study of the possibilities and limitations of artificial intelligence literature

CharPoet: A Chinese Classical Poetry Generation System Based on Token-free LLM

LMs: Understanding Code Syntax and Semantics for Code Analysis

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Bridging Languages: The Potential and Limitations of AI in Literary TranslationA Case Study of the English Translation of A Pair of Peacocks Southeast Fly

How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs

Brazilian version of the Problem Areas in Diabetes Scale (B-PAID): validation and identification of individuals at high risk for emotional distress.

Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets

Research on the Artistic Conception of Multimedia-Assisted Ancient Poetry Based on AI Technology