A hybrid principal label space transformation-based ridge regression and decision tree for multi-label classification
Seyed Hossein Seyed Ebrahimi,Kambiz Majidzadeh,Farhad Soleimanian Gharehchopogh
DOI: https://doi.org/10.1007/s12530-024-09618-0
IF: 2.347
2024-09-25
Evolving Systems
Abstract:In contrast to multi-class classification, in multi-label classification (MLC), each data sample is a member of several classes. In this regard, the classifiers should discover more complex patterns within the data; accordingly, the complexity grows remarkably. Recently, scholars used the binary relevance, the label powerset, and the ensemble method (EM) techniques to extend state-of-the-art classifiers or develop new ones to solve real-world MLC problems more optimally. The current paper recruits the ridge regression (RR) and a decision tree (DT) and presents a new MLC model named PHMLC. In the PHMLC, the Laplacian Eigenmaps Nonlinear Dimensionality Reduction Algorithm is employed to omit the redundant and irrelevant features. Next, the data is encoded by the principal label space transformation (PLST) and singular value decomposition. Then, the RR is recalled to predict the first set of labels through the encoded data. Besides, the Binary Relevance technique is applied to the original DT, and an MLC model called BR-DT is introduced. The BR-DT is responsible for predicting the second set of labels. Eventually, the greedy selection mechanism is utilized to obtain final labels based on the predicted labels by the PLST-based RR and the BR-DT models. To demonstrate the PHMLC's effectiveness, the PHMLC model is applied to nine real-life multi-label datasets, and the results are compared with the BR-SVM, CPLST, BR-Ridge, BR-DR, PLST, and BR-DT models regarding the AP, EM, HS, Macro, and Micro F1 metrics.
computer science, artificial intelligence