CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications

Yupeng Cao,Zhiyuan Yao,Zhi Chen,Zhiyang Deng

2024-07-02

Abstract:The integration of Large Language Models (LLMs) into financial analysis has garnered significant attention in the NLP community. This paper presents our solution to IJCAI-2024 FinLLM challenge, investigating the capabilities of LLMs within three critical areas of financial tasks: financial classification, financial text summarization, and single stock trading. We adopted Llama3-8B and Mistral-7B as base models, fine-tuning them through Parameter Efficient Fine-Tuning (PEFT) and Low-Rank Adaptation (LoRA) approaches. To enhance model performance, we combine datasets from task 1 and task 2 for data fusion. Our approach aims to tackle these diverse tasks in a comprehensive and integrated manner, showcasing LLMs' capacity to address diverse and complex financial tasks with improved accuracy and decision-making capabilities.

Computational Engineering, Finance, and Science,Artificial Intelligence,Machine Learning,Computational Finance

What problem does this paper attempt to address?

The paper aims to address several key challenges in the application of large language models (LLMs) in the financial domain, specifically including: 1. **Financial Classification**: The paper explores how to fine-tune large language models to distinguish between claims and premises in financial texts. This is fundamental to understanding the structure of financial narratives and is crucial for downstream applications such as sentiment analysis, risk assessment, and investment decision-making. 2. **Financial Text Summarization**: The study investigates how to compress lengthy financial documents into concise summaries, retaining key information and insights so that stakeholders can make quick and effective decisions without having to browse through extensive reports. 3. **Single Stock Trading**: It explores how to utilize large language models to analyze various financial texts and other related data to predict the future price trends of individual stocks and make trading decisions based on these predictions. The paper employs parameter-efficient fine-tuning (PEFT) and low-rank adaptation (LoRA) techniques to fine-tune pre-trained large language models (such as Llama3-8B and Mistral-7B). It also combines datasets from tasks 1 and 2 for data fusion to enhance the model's performance in these diverse tasks. Experimental results show that the data fusion strategy significantly improves model performance in financial classification and text summarization tasks. However, it does not show noticeable improvement in the single stock trading task, possibly because the single stock trading task is more complex and may require more advanced data fusion steps or larger-scale datasets for support.

CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications

Data-Centric Financial Large Language Models

A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification

SNFinLLM: Systematic and Nuanced Financial Domain Adaptation of Chinese Large Language Models

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models

L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization

Large Language Model Adaptation for Financial Sentiment Analysis

Evaluating Large Language Models on Financial Report Summarization: An Empirical Study

FMDLlama: Financial Misinformation Detection based on Large Language Models

Enabling and Analyzing How to Efficiently Extract Information from Hybrid Long Documents with LLMs

NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance

CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning

Pre-trained Large Language Models for Financial Sentiment Analysis

Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

Financial News Analytics Using Fine-Tuned Llama 2 GPT Model

Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset.

PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance