Abstract:Fuzzy rough entropy established in the notion of fuzzy rough set theory, which has been effectively and efficiently applied for feature selection to handle the uncertainty in real-valued datasets. Further, Fuzzy rough mutual information has been presented by integrating information entropy with fuzzy rough set to measure the importance of features. However, none of the methods till date can handle noise, uncertainty and vagueness simultaneously due to both judgement and identification, which lead to degrade the overall performances of the learning algorithms with the increment in the number of mixed valued conditional features. In the current study, these issues are tackled by presenting a novel intuitionistic fuzzy (IF) assisted mutual information concept along with IF granular structure. Initially, a hybrid IF similarity relation is introduced. Based on this relation, an IF granular structure is introduced. Then, IF rough conditional and joint entropies are established. Further, mutual information based on these concepts are discussed. Next, mathematical theorems are proved to demonstrate the validity of the given notions. Thereafter, significance of the features subset is computed by using this mutual information, and corresponding feature selection is suggested to delete the irrelevant and redundant features. The current approach effectively handles noise and subsequent uncertainty in both nominal and mixed data (including both nominal and category variables). Moreover, comprehensive experimental performances are evaluated on real-valued benchmark datasets to demonstrate the practical validation and effectiveness of the addressed technique. Finally, an application of the proposed method is exhibited to improve the prediction of phospholipidosis positive molecules. RF(h2o) produces the most effective results till date based on our proposed methodology with sensitivity, accuracy, specificity, MCC, and AUC of 86.7%, 90.1%, 93.0% , 0.808, and 0.922 respectively.

Multivalued Subsets Under Information Theory

Novel algorithm for attribute reduction based on mutual-information gain ratio

An Information-Theoretic Approach to Universal Feature Selection in High-Dimensional Inference.

Feature Selection with Conditional Mutual Information Considering Feature Interaction

A rough set based clustering algorithm and the information theoretical approach to refine clusters

Data Mining in Incomplete Information

Multivariate Analysis of Data Sets with Missing Values: An Information Theory-Based Reliability Function

Identification of Signal, Noise, and Indistinguishable Subsets in High-Dimensional Data Analysis

A Decision-Making Approach for the Evaluation of Information Security Management under Complex Intuitionistic Fuzzy Set Environment

Theoretical Analysis of Submodular Information Measures for Targeted Data Subset Selection

Generalized decomposition of multivariate information

MISFEAT: Feature Selection for Subgroups with Systematic Missing Data

Latest CHORUS and NOMAD results

Feature selection for multiset-valued data based on fuzzy conditional information entropy using iterative model and matrix operation

Enhancing Neural Subset Selection: Integrating Background Information into Set Representations

Information-Theoretic Feature Selection via Tensor Decomposition and Submodularity

Information entropy-assisted intuitionistic fuzzy rough feature subset selection

Information Theory and its Relation to Machine Learning

Towards Information Theory-Based Discovery of Equivariances

Hybrid similarity relation based mutual information for feature selection in intuitionistic fuzzy rough framework and its applications