Using General Large Language Models to Classify Mathematical Documents

Patrick D.F. Ion,Stephen M. Watt

2024-06-12

Abstract:In this article we report on an initial exploration to assess the viability of using the general large language models (LLMs), recently made public, to classify mathematical documents. Automated classification would be useful from the applied perspective of improving the navigation of the literature and the more open-ended goal of identifying relations among mathematical results. The Mathematical Subject Classification MSC 2020, from MathSciNet and zbMATH, is widely used and there is a significant corpus of ground truth material in the open literature. We have evaluated the classification of preprint articles from <a class="link-external link-http" href="http://arXiv.org" rel="external noopener nofollow">arXiv.org</a> according to MSC 2020. The experiment used only the title and abstract alone -- not the entire paper. Since this was early in the use of chatbots and the development of their APIs, we report here on what was carried out by hand. Of course, the automation of the process will have to follow if it is to be generally useful. We found that in about 60% of our sample the LLM produced a primary classification matching that already reported on arXiv. In about half of those instances, there were additional primary classifications that were not detected. In about 40% of our sample, the LLM suggested a different classification than what was provided. A detailed examination of these cases, however, showed that the LLM-suggested classifications were in most cases better than those provided.

Information Retrieval,Computation and Language,Digital Libraries

What problem does this paper attempt to address?

This paper discusses the feasibility of using Large Language Models (LLMs) to classify mathematical documents. The researchers used tools like ChatGPT to generate classification suggestions based on input paper titles and abstracts, and compared them with the Mathematics Subject Classification (MSC 2020) available on arXiv. Preliminary experiments showed that in approximately 60% of the samples, the primary classification provided by the LLMs matched the classifications on arXiv, while in approximately 40% of the samples, the LLMs proposed different classifications. However, further analysis suggested that these differences were often more accurate or appropriate classifications in many cases. The paper introduces the potential of using LLMs for automated classification, especially in improving literature navigation and identifying relationships between mathematical results. Despite some mismatches, the researchers believe that the classifications proposed by LLMs are superior to manually provided classifications in many cases. Future research directions may include improving methods to enhance the accuracy and reliability of automated classification.

Using General Large Language Models to Classify Mathematical Documents

AutoMSC: Automatic Assignment of Mathematics Subject Classification Labels

Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange

Large Language Models for Mathematicians

Mathematical Language Models: A Survey

Large Language Models for Mathematical Reasoning: Progresses and Challenges

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Can Large Language Models Serve as Effective Classifiers for Hierarchical Multi-Label Classification of Scientific Documents at Industrial Scale?

MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkit

math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories

Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering

Towards a Mathematics Formalisation Assistant using Large Language Models

Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology

Transforming Scholarly Landscapes: Influence of Large Language Models on Academic Fields beyond Computer Science

Document-Level Machine Translation with Large Language Models

An Interdisciplinary Outlook on Large Language Models for Scientific Research

Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching

Evaluating Language Models for Mathematics through Interactions

The emergence of Large Language Models (LLM) as a tool in literature reviews: an LLM automated systematic review