MF-DAT: a stock trend prediction of the double-graph attention network based on multisource information fusion

Xiong, Neal
DOI: https://doi.org/10.1007/s00530-024-01333-9
IF: 3.9
2024-04-27
Multimedia Systems
Abstract:Stock forecasting research, which aims to predict the future price movement of stocks, has been the focus of investors and scholars. This is important for practical applications related to human-centric computing and information sciences. Previous research has generally only considered market information other than the relationship between stocks, and it is challenging to learn a better representation of stock characteristics by considering the relationship between stocks. In the existing methods of combining market information with stock relationship modeling, most of them use predefined industry relationships to construct stock relationship diagrams, which inevitably ignores the potential interactions between stocks, especially the hidden relationships between stock groups. To this end, a new dual-graph attention model (MF-DAT) based on multisource information fusion is designed. Specifically, first, multiple features are fused by the LMF module, then the long-term and short-term state characteristics of stocks are learned through the first layer of the graph attention layer, and finally the node representation of the stock relationship network constructed by the mining stock cluster structure through community detection is updated. Our model takes into account both stock time-series information and potential relationships between stocks. Experiments on the S &P 500 and NASDAQ datasets show that our MF-DAT has better performance than the 8 SOTA methods that are now more popular.
computer science, information systems, theory & methods
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of predicting stock price trends. Specifically, the authors propose a novel dual graph attention model (MF-DAT) that leverages multi-source information fusion to achieve stock trend prediction. #### Main Issues 1. **Limitations of Existing Methods**: Traditional stock prediction methods mainly rely on time series analysis of historical prices, neglecting the relationships and mutual influences between stocks. 2. **Insufficient Stock Relationship Modeling**: Although some studies attempt to combine market information with stock relationship modeling, most methods depend on predefined industry relationships, overlooking potential interactions between stocks. 3. **Improvement in Prediction Performance**: To better describe the characteristics of each stock, it is necessary not only to model market information (such as historical prices, financial news, etc.) but also to model and analyze the potential relationships between stock clusters. #### Solutions - **Multi-Feature Fusion (LMF Module)**: The LMF module fuses multiple features to capture the interaction characteristics between different market information. - **Community Detection to Uncover Stock Relationships**: Based on the correlation of stock return fluctuations, community detection algorithms are used to uncover the cluster structure among stocks, discover new stock relationships, and learn the mutual influences between stocks. - **Dual Graph Attention Model (MF-DAT)**: A dual graph attention model (MF-DAT) based on multi-source information fusion is proposed. Experiments on the S&P 500 and NASDAQ indices show that this model outperforms other benchmark methods. ### Summary The main contributions of this paper are: 1. Proposing a novel multi-feature fusion method (LMF module) to capture the interaction characteristics between various information modalities in the stock market. 2. Utilizing community detection algorithms to uncover the cluster structure among stocks, discover new stock relationships, and address the limitations of representing stock relationships with predefined industry relationships. 3. Proposing a dual graph attention model (MF-DAT) based on multi-source information fusion, with experimental results on real datasets demonstrating that this model outperforms other benchmark methods.