Survey of the loss function in classification models: Comparative study in healthcare and medicine

DOI: https://doi.org/10.1007/s11042-024-19543-8
IF: 2.577
2024-06-06
Multimedia Tools and Applications
Abstract:The selection of an appropriate classification approach depends heavily on the classification rate, which is the most important factor in achieving the desired decision quality. While researchers have examined the impact of different features on the performance of classification approaches, cost/loss functions have received less attention in the comparative literature review, despite their theoretical significance in influencing the classification rate. This paper aims to address this gap by conducting a comparative study on the influence of different cost/loss functions on the classification rate of diverse classifiers. To achieve this objective, the study considers the five most popular and commonly utilized types of cost/loss functions: linear and nonlinear continuous, linear and nonlinear semi-continuous, and discrete cost/loss functions. Furthermore, it takes into consideration the three primary categories of classification approaches: statistical, intelligent, and deep learning classifiers. In addition, a total of 44 benchmark datasets from three distinct domains of medicine, specifically cancer and disease diagnosis, therapy, and biology science, are chosen for analysis. Based on empirical findings, it is evident that the selection of cost/loss functions has a notable impact on the classification rate. The numerical results demonstrate that the discrete cost/loss function performs the best, followed by the semi-continuous and continuous cost/loss functions, in that order. This clearly highlights the positive and direct correlation between aligning the cost/loss function with the goal function of classification approaches and achieving a higher classification rate. Moreover, the average effectiveness of the nonlinear versions of the semi-continuous and continuous cost/loss functions is comparable to that of their linear counterparts. While the choice of cost/loss function can influence the classification rate of various classifiers, the degree of improvement varies depending on the classifier type. In general, statistical classifiers demonstrate a greater degree of enhancement, followed by intelligent classifiers and deep learning models in second and third positions, respectively. Overall, the study reveals a negative correlation between the complexity of classifiers and the improvement in the classification rate when altering the cost/loss function. Furthermore, the numerical findings suggest that the variations in the degree of improvement achieved by changing the cost/loss functions are substantial and affected by the type and domain of the data.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?