Financial Sentiment Analysis on News and Reports Using Large Language Models and FinBERT

Yanxin Shen,Pulin Kirin Zhang
2024-10-03
Abstract:Financial sentiment analysis (FSA) is crucial for evaluating market sentiment and making well-informed financial decisions. The advent of large language models (LLMs) such as BERT and its financial variant, FinBERT, has notably enhanced sentiment analysis capabilities. This paper investigates the application of LLMs and FinBERT for FSA, comparing their performance on news articles, financial reports and company announcements. The study emphasizes the advantages of prompt engineering with zero-shot and few-shot strategy to improve sentiment classification accuracy. Experimental results indicate that GPT-4o, with few-shot examples of financial texts, can be as competent as a well fine-tuned FinBERT in this specialized field.
Information Retrieval,Computation and Language,Social and Information Networks,General Finance
What problem does this paper attempt to address?
The paper aims to explore the application of large language models (LLMs) and BERT variants specifically tailored for the financial domain (FinBERT) in financial sentiment analysis (FSA). Specifically, the researchers compared the performance of these models on news articles, financial reports, and company announcements, with a focus on the impact of zero-shot and few-shot prompt engineering strategies on improving sentiment classification accuracy. The main objectives of the paper include: 1. **Evaluating the performance of LLMs and FinBERT in financial sentiment analysis**: Assessing the accuracy and effectiveness of these models in handling financial texts through different configurations (such as zero-shot, few-shot, and fine-tuning). 2. **Exploring the effectiveness of prompt engineering**: Investigating how to design effective prompts to help LLMs better understand and classify sentiment information in financial texts. 3. **Comparing the performance of general models and domain-specific models**: Particularly comparing general language models like GPT-4 with the specially trained FinBERT to understand their relative advantages in financial sentiment analysis tasks. The experimental results indicate that although GPT-4 performs well in few-shot settings, approaching the performance of well-fine-tuned FinBERT, the latter still demonstrates higher accuracy and robustness in financial sentiment analysis due to its specialized domain pre-training. This highlights the importance of domain-specific models in handling specialized domain data.