From Text Attribution to Data Extraction: Applications of Big Language Models in Historical Science

A. V. Kuznetsov,
DOI: https://doi.org/10.36683/fp-19/153-158
2024-01-01
Education and Science without Limits Fundamental and Applied Researches
Abstract:The article examines the experience and prospects of using large language models in historical research and humanities. It analyzes the experience of using these models for at-tributing ancient texts, reconstructing damaged manuscripts, studying the evolution of word meanings, and linguistic reconstruction. The tendency to use models for extracting latent characteristics of texts is noted, such as topics and emotional coloring. It also provides an example of creating a specialized software tool – KleioGPT – that allows integrating large language models with thematic corpora of academic sources. Experimental evaluation demonstrated a significant improvement in the quality of answers to history questions and high accuracy in extracting structured data when using KleioGPT in conjunction with the models like ChatGPT.
What problem does this paper attempt to address?