Leveraging Local Density Decision Labeling and Fuzzy Dependency for Semi-supervised Feature Selection

Gangqiang Zhang,Jingjing Hu,Pengfei Zhang
DOI: https://doi.org/10.1007/s40815-024-01740-0
IF: 4.085
2024-05-27
International Journal of Fuzzy Systems
Abstract:In real-world scenarios, datasets often lack full supervision due to the high cost associated with acquiring decision labels. Completing datasets by filling in missing labels is essential for preserving the valuable feature information of individual samples. Furthermore, in the era of big data, datasets tend to exhibit high dimensionality, which adds complexity to subsequent data processing. In this study, a new semi-supervised feature selection technique is introduced. Firstly, a fully supervised dataset is created by utilizing a local density decision-labeling algorithm to fill in missing decision labels within the semi-supervised dataset. Next, a fuzzy dependency-based feature selection approach is presented to find and keep the most pertinent characteristics for the finished datasets. Finally, the effectiveness and reliability of our proposed method are validated through a series of rigorous experiments.
computer science, information systems,automation & control systems, artificial intelligence
What problem does this paper attempt to address?