A New Representation Learning Approach for Credit Data Analysis

Tie Li,Gang Kou,Yi Peng
DOI: https://doi.org/10.1016/j.ins.2023.01.068
IF: 8.1
2023-01-01
Information Sciences
Abstract:Representation learning has an important impact on the performance of machine learning methods and has been used to solve many distribution problems for numerous graphical and sequential mining tasks. While the distributions of credit data are very complex, the represen-tations of such data are less studied. This study proposes a new representation learning approach based on a neural network called Nystro center dot mNet, which represents the credit data to benefit credit evaluation and sub-pattern analysis. The Nystro center dot mNet is developed to utilize the advantages of the Nystro center dot m method - a kernel approximation method in credit evaluation, yet overcomes its two limitations: distance distortions in kernel functions, and parameter tuning. The two main modules contained in Nystro center dot mNet, i.e., the Distance Metric Learning module and the Nystro center dot m module, can benefit each other and yield an overall optimum. Experiments using six real-life large-scale credit data showed that the AUC of the distance-based classifiers and the linear classifiers were improved by 2-11% and 2-14% with the newly generated distributions. The proposed approach also has certain practical advantages over traditional approaches because it is free from complex parameter tuning, consumes fewer memories, and is easy to utilize automatic differential frameworks such as PyTorch. The proposed approach is highly suitable for large-scale credit evaluation.
What problem does this paper attempt to address?