Financial Data Analysis Using Expert Bayesian Framework For Bankruptcy Prediction

Amir Mukeri,Habibullah Shaikh,Dr. D.P. Gaikwad
DOI: https://doi.org/10.48550/arXiv.2010.13892
2020-10-30
Abstract:In recent years, bankruptcy forecasting has gained lot of attention from researchers as well as practitioners in the field of financial risk management. For bankruptcy prediction, various approaches proposed in the past and currently in practice relies on accounting ratios and using statistical modeling or machine learning methods. These models have had varying degrees of successes. Models such as Linear Discriminant Analysis or Artificial Neural Network employ discriminative classification techniques. They lack explicit provision to include prior expert knowledge. In this paper, we propose another route of generative modeling using Expert Bayesian framework. The biggest advantage of the proposed framework is an explicit inclusion of expert judgment in the modeling process. Also the proposed methodology provides a way to quantify uncertainty in prediction. As a result the model built using Bayesian framework is highly flexible, interpretable and intuitive in nature. The proposed approach is well suited for highly regulated or safety critical applications such as in finance or in medical diagnosis. In such cases accuracy in the prediction is not the only concern for decision makers. Decision makers and other stakeholders are also interested in uncertainty in the prediction as well as interpretability of the model. We empirically demonstrate these benefits of proposed framework on real world dataset using Stan, a probabilistic programming language. We found that the proposed model is either comparable or superior to the other existing methods. Also resulting model has much less False Positive Rate compared to many existing state of the art methods. The corresponding R code for the experiments is available at Github repository.
Statistical Finance,Machine Learning
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are **uncertainty in bankruptcy prediction and model interpretability problems**. Specifically, although existing bankruptcy prediction methods (such as linear discriminant analysis, artificial neural networks, etc.) have achieved certain success in terms of accuracy, they lack the explicit incorporation of expert knowledge and are unable to quantify the uncertainty in prediction. In addition, these models are usually black - box models and it is difficult to explain their internal mechanisms, which is a major drawback for fields that require a high degree of interpretability (such as finance and medical diagnosis). ### Main contributions of the paper 1. **Introducing the expert Bayesian framework**: - By using the Bayesian framework, this method can explicitly incorporate expert judgment into the modeling process, thereby improving the flexibility, interpretability, and intuitiveness of the model. 2. **Quantifying uncertainty**: - The Bayesian method can not only provide prediction results but also quantify the uncertainty in prediction, which is very important for decision - makers. 3. **Improving model interpretability**: - Unlike traditional machine - learning models, Bayesian models can more intuitively explain their prediction results, making it easier for stakeholders to understand the operating principles of the model. 4. **Reducing the false - positive rate**: - The experimental results show that the proposed Bayesian generalized linear model (GLM) significantly reduces the false - positive rate while maintaining a high accuracy rate. ### Method overview - **Dataset**: The study used the financial data of Polish companies from 2000 - 2012, including 64 financial ratios and the label of whether they went bankrupt or not. - **Model selection**: By comparing different variable combinations (such as short - term debt ratio, long - term debt ratio, etc.), the model with better performance (Model #2) was finally selected. - **Bayesian inference**: Bayesian inference was carried out using the Stan language, and the posterior distribution was estimated through Markov chain Monte Carlo (MCMC) sampling. - **Performance evaluation**: The performance of the model was verified through K - fold cross - validation and the confusion matrix on the test set, and it was compared with other traditional methods (such as Z - score, SVM, ANN, etc.). ### Conclusion This research shows that the Bayesian generalized linear model is not only competitive in prediction accuracy but also performs well in interpretability and uncertainty quantification. In particular, financial ratios related to short - term debt have been proven to be important indicators for predicting corporate bankruptcy. This method provides new research directions and application possibilities for fields such as credit risk management and investment decision - making.