ETF Portfolio Construction via Neural Network trained on Financial Statement Data

Jinho Lee,Sungwoo Park,Jungyu Ahn,Jonghun Kwak
DOI: https://doi.org/10.48550/arXiv.2207.01187
2022-07-04
Abstract:Recently, the application of advanced machine learning methods for asset management has become one of the most intriguing topics. Unfortunately, the application of these methods, such as deep neural networks, is difficult due to the data shortage problem. To address this issue, we propose a novel approach using neural networks to construct a portfolio of exchange traded funds (ETFs) based on the financial statement data of their components. Although a number of ETFs and ETF-managed portfolios have emerged in the past few decades, the ability to apply neural networks to manage ETF portfolios is limited since the number and historical existence of ETFs are relatively smaller and shorter, respectively, than those of individual stocks. Therefore, we use the data of individual stocks to train our neural networks to predict the future performance of individual stocks and use these predictions and the portfolio deposit file (PDF) to construct a portfolio of ETFs. Multiple experiments have been performed, and we have found that our proposed method outperforms the baselines. We believe that our approach can be more beneficial when managing recently listed ETFs, such as thematic ETFs, of which there is relatively limited historical data for training advanced machine learning methods.
Computational Finance,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the data shortage problem faced when applying advanced machine - learning methods (such as deep neural networks) in constructing ETF (Exchange - Traded Fund) portfolios. Specifically, due to the relatively small number of ETFs and their relatively short historical existence, this has led to difficulties in using these data to train advanced machine - learning models. To overcome this challenge, the author proposes a novel method, that is, using the financial statement data of the individual stocks that make up the ETF to train the neural network, and then predict the future performance of the individual stocks, and use these prediction results and the ETF's portfolio deposit file (PDF) to construct the ETF portfolio. This method can not only effectively alleviate the data shortage problem, but also show greater advantages in managing recently listed ETFs (such as thematic ETFs), because the historical data of this type of ETF is relatively limited and difficult to be directly used to train advanced machine - learning models.