Abstract:This study thoroughly examined the use of different machine learning models to predict financial distress in Indonesian companies by utilizing the Financial Ratio dataset collected from the Indonesia Stock Exchange (IDX), which includes financial indicators from various companies across multiple industries spanning a decade. By partitioning the data into training and test sets and utilizing SMOTE and RUS approaches, the issue of class imbalances was effectively managed, guaranteeing the dependability and impartiality of the model’s training and assessment. Creating first models was crucial in establishing a benchmark for performance measurements. Various models, including Decision Trees, XGBoost, Random Forest, LSTM, and Support Vector Machine (SVM) were assessed. The ensemble models, including XGBoost and Random Forest, showed better performance when combined with SMOTE. The findings of this research validate the efficacy of ensemble methods in forecasting financial distress. Specifically, the XGBClassifier and Random Forest Classifier demonstrate dependable and resilient performance. The feature importance analysis revealed the significance of financial indicators. Interest_coverage and operating_margin, for instance, were crucial for the predictive capabilities of the models. Both companies and regulators can utilize the findings of this investigation. To forecast financial distress, the XGB classifier and the Random Forest classifier could be employed. In addition, it is important for them to take into account the interest coverage ratio and operating margin ratio, as these finansial ratios play a critical role in assessing their performance. The findings of this research confirm the effectiveness of ensemble methods in financial distress prediction. The XGBClassifier and RandomForestClassifier demonstrate reliable and robust performance. Feature importance analysis highlights the significance of financial indicators, such as interest coverage ratio and operating margin ratio, which are crucial to the predictive ability of the models. These findings can be utilized by companies and regulators to predict financial distress.

Financial Distress Prediction Using a Corrected Feature Selection Measure and Gradient Boosted Decision Tree

Multi-class Financial Distress Prediction Based on Feature Selection and Deep Forest Algorithm

Financial distress prediction using an improved particle swarm optimization wrapper feature selection method and tree boosting ensemble

Interpreting the prediction results of the tree‐based gradient boosting models for financial distress prediction with an explainable machine learning approach

Corporate Financial Distress Prediction: Based on Multi-source Data and Feature Selection

Financial distress prediction based on ensemble feature selection and improved stacking algorithm

Improving financial distress prediction using machine learning: A preliminary study

Advancing financial analytics: Integrating XGBoost, LSTM, and Random Forest Algorithms for precision forecasting of corporate financial distress

Class‐imbalanced financial distress prediction with machine learning: Incorporating financial, management, textual, and social responsibility features into index system

Financial Distress Prediction with Optimaldecision Trees Based on the Optimalsampling Probability

Improving Financial Distress Prediction Using Financial Network-Based Information and GA-Based Gradient Boosting Method

Cost-sensitive AdaBoost Selective Ensemble for Financial Distress Prediction

Incorporating Multiple Textual Factors into Unbalanced Financial Distress Prediction: A Feature Selection Methods and Ensemble Classifiers Combined Approach

CUS-heterogeneous ensemble-based financial distress prediction for imbalanced dataset with ensemble feature selection

Dynamic forecasting of financial distress: the hybrid use of incremental bagging and genetic algorithm—empirical study of Chinese listed corporations

Improving financial distress prediction using textual sentiment of annual reports

Novel feature selection methods to financial distress prediction

Use of Hybrid Fuzzy c-means and Probabilistic Neural Network Based on Improved Particle Swarm Optimization in the Prediction of Financial Distress

Corporate distress prediction in China: a machine learning approach

The Prediction for Listed Companies' Financial Distress by Using Multiple Prediction Methods with Rough Set and Dempster-Shafer Evidence Theory

A Gradient-Boosting Decision-Tree Approach for Firm Failure Prediction: an Empirical Model Evaluation of Chinese Listed Companies