Large Language Model Enhanced Machine Learning Estimators for Classification

Yuhang Wu,Yingfei Wang,Chu Wang,Zeyu Zheng

2024-05-09

Abstract:Pre-trained large language models (LLM) have emerged as a powerful tool for simulating various scenarios and generating output given specific instructions and multimodal input. In this work, we analyze the specific use of LLM to enhance a classical supervised machine learning method for classification problems. We propose a few approaches to integrate LLM into a classical machine learning estimator to further enhance the prediction performance. We examine the performance of the proposed approaches through both standard supervised learning binary classification tasks, and a transfer learning task where the test data observe distribution changes compared to the training data. Numerical experiments using four publicly available datasets are conducted and suggest that using LLM to enhance classical machine learning estimators can provide significant improvement on prediction performance.

Machine Learning

What problem does this paper attempt to address?

This paper discusses how to use large language models (LLMs) to enhance the performance of traditional machine learning classifiers. Several methods that combine LLMs with classical machine learning approaches are proposed to improve prediction accuracy. Specifically, these methods include: 1. Linear combination method: By weighted linearly combining the predictions of LLM and the machine learning (ML) model, particularly when the ML model is uncertain about boundary data, rely more on the predictions of LLM. 2. LLM predictions as additional information: Incorporating the predictions of LLM as contextual information into model calibration to enhance the performance of classical machine learning models. 3. Transfer learning task: Utilizing the labels generated by LLM to augment the training data in transfer learning tasks with distributional changes, thereby improving the performance of machine learning models on new distributions. In the experimental section, the paper demonstrates the superiority of these methods in tasks such as relevance prediction, sentiment recognition, and hate speech detection using four publicly available datasets. It shows that machine learning models combining LLMs outperform using LLMs or machine learning models alone in terms of predictive performance. In summary, the paper attempts to address how to effectively integrate LLMs to enhance the performance of machine learning algorithms in classification tasks, particularly when dealing with distributional changes and boundary cases, as well as how to utilize LLMs for transfer learning to adapt to new data distributions.

Large Language Model Enhanced Machine Learning Estimators for Classification

Improving Clinical Expertise in Large Language Models Using Electronic Medical Records

Revisited Large Language Model for Time Series Analysis through Modality Alignment

Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science

Supervised Knowledge Makes Large Language Models Better In-context Learners

MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications

Adaptable and Reliable Text Classification using Large Language Models

A Survey on Evaluation of Large Language ModelsJust Accepted

Large Language Models for Education: A Survey and Outlook

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

A Survey on Evaluation of Large Language Models

Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy

Exploring Large Language Models for Feature Selection: A Data-centric Perspective

Large Language Models as Surrogate Models in Evolutionary Algorithms: A Preliminary Study

A Survey of Large Language Models

Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling

Integrating Stock Features and Global Information via Large Language Models for Enhanced Stock Return Prediction

LLM-Select: Feature Selection with Large Language Models

EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model