Multigranularity Data Analysis with Zentropy Uncertainty Measure for Efficient and Robust Feature Selection

Kehua Yuan,Duoqian Miao,Witold Pedrycz,Hongyun Zhang,Liang Hu
DOI: https://doi.org/10.1109/tcyb.2024.3499952
IF: 11.8
2024-01-01
IEEE Transactions on Cybernetics
Abstract:Multigranularity data analysis has recently become an active research topic in the intelligent computing and data mining fields. Feature selection via multigranularity data analysis is an effective tool for characterizing hierarchical data and enhancing the accuracy of the results. Although the multigranularity data analysis method has been widely adopted for feature selection, existing studies still present one prevalent disadvantage: multigranularity data analysis mostly focuses on information presented at a single granularity while ignoring the hierarchical structure of multigranularity data, which is contrary to the nature of multigranularity. Hence, this article proposes a multigranularity data analysis with a zentropy uncertainty measure for efficient and robust feature selection. Specifically, a consistent degree is first introduced to obtain optimal granularity combinations and establish an efficient neighborhood model for multigranularity information processing. Then, a novel and robust uncertainty measure is developed by integrating the multigranularity information, namely the zentropy-based measure. Considering its accuracy among uncertainty measures, two important measures are further designed and applied to feature selection. Extensive experiments demonstrate that the proposed method can achieve better robustness and classification performance than other state-of-the-art methods.
What problem does this paper attempt to address?