Abstract:Heart disease has become a non-ignorable threat to human health in recent years. Once without timely diagnosis and treatment, patients often suffer disability or even death. However, the diagnosis accuracy directly relies on different doctors' experiences and various factors associated with heart disease bring heavy tasks on them make the situation worse. Therefore, to improve heart disease treatment, introducing computer-aided techniques to assist doctors in diagnosis is a feasible approach. At present, researchers usually use the processed dataset (13 features) selected by doctors from the unprocessed dataset (74 features) (UCI Machine Learning Repository) and apply the feature selection method to the dataset, it's inappropriate because the feature scale is so small. People neglect the unprocessed dataset's value and don't realize it could contain some latent information. A comprehensive comparison is needed to demonstrate the unprocessed dataset's advantages. Besides, the incremental feature combination method should be verified. As the minimum Redundancy - Maximum Relevance (mRMR) gains great success in feature selection, applying it as a feature filter can enhance classification accuracy. Thus, in this research, we introduced the mRMR method as a filter for feature selection and made a comprehensive comparison within several methods like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Kendall, Random Forest, and other research works in several metrics. By analyzing the results, in most cases, the unprocessed dataset can enhance algorithm's performance. The incremental feature selection method is effective and the mRMR is superior to other methods. Not only does it own the highest accuracies, but also the least supportive features. It has 100% accuracy with 8 features on the Cleveland dataset, 98.3% accuracy with 14 features - n Hungarian, and 99% accuracy with 9 features on Long-beach-VA, respectively. Furthermore, we find that some features, which doctors regard as useless, play a part in classification, that should attract some attention from doctors.

Improved Measures of Redundancy and Relevance for mRMR Feature Selection

Improved Measures of Redundancy and Relevance for mRMR Feature Selection

The mRMR variable selection method: a comparative study for functional data

A new improved maximal relevance and minimal redundancy method based on feature subset

MVMR-FS : Non-parametric feature selection algorithm based on Maximum inter-class Variation and Minimum Redundancy

Feature Selection with Conditional Mutual Information Considering Feature Interaction

Feature Selection by mRMR Method for Heart Disease Diagnosis

A Mrmrmsrc Feature Selection Method For Radiomics Approach

An improved conditional relevance and weighted redundancy feature selection method for gene expression data

A Hybrid Feature-Selection Method Based on mRMR and Binary Differential Evolution for Gene Selection

Maximum Relevance and Minimum Redundancy Feature Selection Methods for a Marketing Machine Learning Platform

A Recognition Method for Diabetic Retinopathy Based on Feature Selection

Stable feature selection using MRMR algorithm

A Feature Selection Method Using Conditional Correlation Dispersion and Redundancy Analysis

SVM-RFE With MRMR Filter for Gene Selection

Discovering the Representative Subset with Low Redundancy for Hyperspectral Feature Selection

A New Method for Redundancy Analysis in Feature Selection

A Novel Feature Selection Method Based on MRMR and Enhanced Flower Pollination Algorithm for High Dimensional Biomedical Data

Fed-mRMR: A lossless federated feature selection method