Abstract:The feature selection in a traditional binary classification algorithm is always used in the stage of dataset preprocessing, which makes the obtained features not necessarily the best ones for the classification algorithm, thus affecting the classification performance. For a traditional rule-based binary classification algorithm, classification rules are usually deterministic, which results in the fuzzy information contained in the rules being ignored. To do so, this paper employs iterative feature selection in fuzzy rule-based binary classification. The proposed algorithm combines feature selection based on fuzzy correlation family with rule mining based on biclustering. It first conducts biclustering on the dataset after feature selection. Then it conducts feature selection again for the biclusters according to the feedback of biclusters evaluation. In this way, an iterative feature selection framework is build. During the iteration process, it stops until the obtained bicluster meets the requirements. In addition, the rule membership function is introduced to extract vectorized fuzzy rules from the bicluster and construct weak classifiers. The weak classifiers with good classification performance are selected by Adaptive Boosting and the strong classifier is constructed by "weighted average". Finally, we perform the proposed algorithm on different datasets and compare it with other peers. Experimental results show that it achieves good classification performance and outperforms its peers.

What problem does this paper attempt to address?

The main problems this paper attempts to address are: 1. **Separation of Feature Selection and Classification Algorithms**: In traditional binary classification algorithms, feature selection is usually performed during the data preprocessing stage. This leads to the selected features not necessarily being the most suitable for the classification algorithm, thereby affecting classification performance. Additionally, the lack of interaction between feature selection and classification algorithms makes the feature selection process often blind, unable to be optimized according to the needs of the classification algorithm. 2. **Limitations of Deterministic Rules**: For rule-based traditional binary classification algorithms, classification rules are usually deterministic, meaning that the combination of features in the rules corresponds to a certain class of samples. However, in actual binary classification problems, the same set of feature values may correspond to different types of samples, i.e., the rules are actually fuzzy. Therefore, extracting deterministic rules may lead to information loss and reduce classification performance. To address these issues, the paper proposes a binary classification algorithm that combines an iterative feature selection framework and fuzzy rules. Specifically, the algorithm is implemented through the following steps: - **Iterative Feature Selection Framework**: First, a feature selection method based on fuzzy relevance families is used to select features from the original dataset. Then, a heuristic biclustering algorithm is used to bicluster the selected feature dataset to mine classification rules. By evaluating the support \( S \) of the biclusters, it is determined whether the biclusters meet the requirements. If not, feature selection based on fuzzy relevance families is performed again, and biclustering search is repeated. This process iterates continuously until the optimal biclusters are obtained. - **Fuzzy Rule Extraction**: Rule membership functions are introduced to extract fuzzy rules from the biclusters. These fuzzy rules are classified according to the membership functions and weak classifiers are constructed. - **Ensemble Learning**: The AdaBoost algorithm is used to test the weak classifiers, select well-performing weak classifiers, and construct a strong classifier through weighted voting. Through the above methods, this paper aims to improve the performance of binary classification tasks, especially when dealing with large-scale, high-dimensional datasets, effectively reducing computational complexity and improving classification accuracy.

Employing Iterative Feature Selection in Fuzzy Rule-Based Binary Classification

A Novel Fuzzy Classification to Enhance Software Regression Testing

TSFNFS: two-stage-fuzzy-neighborhood feature selection with binary whale optimization algorithm

An Improved Feature Selection Algorithm for Ordinal Classification.

Multiclass Cancer Classification by Using Fuzzy Support Vector Machine and Binary Decision Tree with Gene Selection

Understanding the classes better with class-specific and rule-specific feature selection, and redundancy control in a fuzzy rule based framework

FCM_FS: A Simultaneous Clustering and Feature Selection Model for Classification

A novel feature selection approach for biomedical data classification.

A Novel Feature Selection Approach for Biomedical Data Classification

Feature Selection Approach Based on Improved Fuzzy C-Means with Principle of Refined Justifiable Granularity

Feature Selection With Fuzzy-Rough Minimum Classification Error Criterion

An Approach To Feature Selection Based On Fuzzy Clustering And Statistic Theory

EBBA: An Enhanced Binary Bat Algorithm Integrated with Chaos Theory and Levy Flight for Feature Selection

Feature Selection and Rule Generation Integrated Learning for Takagi-Sugeno-Kang Fuzzy System and Its Application in Medical Data Classification

Fuzzy Rough Sets-Based Incremental Feature Selection for Hierarchical Classification

Effective Feature Selection Using Feature Vector Graph for Classification

An incremental approach to hierarchical feature selection by applying fuzzy rough set technique

A New Supervised Feature Selection Method for Pattern Classification.

Cascaded two-stage feature clustering and selection via separability and consistency in fuzzy decision systems

Feature selection using feature ranking, correlation analysis and chaotic binary particle swarm optimization

An Adaptive Neuro-Fuzzy System with Integrated Feature Selection and Rule Extraction for High-Dimensional Classification Problems