Predicting the Risk Level of a Loan Based on the Customer's Personal Factors Using Machine Learning Models
Jones Yeboah,Isaac Kofi Nti,Jacob Hedrick
DOI: https://doi.org/10.1109/ICMI60790.2024.10586183
2024-04-13
Abstract:Banks are the backbone of the global financial structure, and one of the keyways these organizations generate income is loan interest. If customers default on these loans, it can turn a gain into a significant loss for the bank, making it crucial to determine the risk of default before granting a loan. Machine learning algorithms can be a great tool for quickly and accurately determining if a loan should be granted. In this study, six machine learning models, namely Decision Tree, Random Forest, Support Vector Machine (SVM), Multi-layer Perceptron (MLP) Artificial Neural Network, Naive Bayes, and a stacking ensemble model were trained to make predictions of the risk level associated with a loan using a dataset containing twenty factors commonly included in a loan application. Among these models, the stacking ensemble model produced the best results with an accuracy of 78.75%, but the Random Tree model was more efficient and produced similar results with an accuracy of 78.15%. We observed that factors like credit amount, checking status, age of the customer, duration of the loan, and purpose of the loan were the most significant predictors of credit risk. The outcome of this study provides further evidence that machine learning models can be valuable tools in the loan approval process.
Computer Science,Business