Fine-tuning and Utilization Methods of Domain-specific LLMs

Cheonsu Jeong

2024-01-25

Abstract:Recent releases of pre-trained Large Language Models (LLMs) have gained considerable traction, yet research on fine-tuning and employing domain-specific LLMs remains scarce. This study investigates approaches for fine-tuning and leveraging domain-specific LLMs, highlighting trends in LLMs, foundational models, and methods for domain-specific pre-training. Focusing on the financial sector, it details dataset selection, preprocessing, model choice, and considerations crucial for LLM fine-tuning in finance. Addressing the unique characteristics of financial data, the study explores the construction of domain-specific vocabularies and considerations for security and regulatory compliance. In the practical application of LLM fine-tuning, the study outlines the procedure and implementation for generating domain-specific LLMs in finance. Various financial cases, including stock price prediction, sentiment analysis of financial news, automated document processing, research, information extraction, and customer service enhancement, are exemplified. The study explores the potential of LLMs in the financial domain, identifies limitations, and proposes directions for improvement, contributing valuable insights for future research. Ultimately, it advances natural language processing technology in business, suggesting proactive LLM utilization in financial services across industries.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

This paper attempts to address the issue of how to fine-tune and apply large language models (LLMs) in the financial domain. Specifically, the paper focuses on the following points: 1. **Characteristics of Financial Data**: Financial data has unique characteristics, such as rapidly changing market environments, diverse financial events, and large volumes of data. The paper explores how to select, preprocess, and choose models based on these characteristics. 2. **Fine-Tuning Methods**: The research proposes specific methods for fine-tuning domain-specific LLMs in the financial field, including the selection of datasets, preprocessing techniques, and the choice of fine-tuning algorithms. 3. **Security and Compliance**: Considering the special requirements of the financial industry, such as trust, consumer protection regulations, and inclusive finance, the paper also discusses how to ensure the security and compliance of the models. 4. **Practical Application Cases**: Through application examples in various financial scenarios such as stock price prediction, financial news sentiment analysis, automated document processing, information extraction, and customer service enhancement, the paper demonstrates the potential value of LLMs in the financial domain and points out existing limitations and future improvement directions. 5. **Efficiency Improvement**: The ultimate goal is to explore effective fine-tuning methods to improve the performance of language models in financial tasks, thereby enhancing productivity and supporting the decision-making process. In summary, this paper aims to provide the financial industry with a foundational understanding of the core technologies and potential application scenarios of domain-specific LLMs, bringing practical value to financial institutions and research organizations.

Fine-tuning and Utilization Methods of Domain-specific LLMs

A Survey of Large Language Models in Finance (FinLLMs)

A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification

Large Language Models in Finance: A Survey

A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

Empirical Study of LLM Fine-Tuning for Text Classification in Legal Document Review

Data-Centric Financial Large Language Models

Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training

Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations

Evaluating Large Language Models on Financial Report Summarization: An Empirical Study

SNFinLLM: Systematic and Nuanced Financial Domain Adaptation of Chinese Large Language Models

Large Language Model Adaptation for Financial Sentiment Analysis

CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications

InvestLM: A Large Language Model for Investment using Financial Domain Instruction Tuning

Pre-trained Large Language Models for Financial Sentiment Analysis

Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance

DEVELOPMENT AND IMPLEMENTATION OF SYSTEMS BASED ON LLM IN FINANCE

Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities

FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models

Fine-Tuning Large Language Models in Education

FinGPT: Democratizing Internet-scale Data for Financial Large Language Models