Abstract:AbstractCancer class discovery using biomolecular data is one of the most important tasks for cancer diagnosis and treatment. Tumor clustering from gene expression data provides a new way to perform cancer class discovery. Most of the existing research works adopt single-clustering algorithms to perform tumor clustering is from biomolecular data that lack robustness, stability, and accuracy. To further improve the performance of tumor clustering from biomolecular data, we introduce the fuzzy theory into the cluster ensemble framework for tumor clustering from biomolecular data, and propose four kinds of hybrid fuzzy cluster ensemble frameworks (HFCEF), named as HFCEF-I, HFCEF-II, HFCEF-III, and HFCEF-IV, respectively, to identify samples that belong to different types of cancers. The difference between HFCEF-I and HFCEF-II is that they adopt different ensemble generator approaches to generate a set of fuzzy matrices in the ensemble. Specifically, HFCEF-I applies the affinity propagation algorithm (AP) to perform clustering on the sample dimension and generates a set of fuzzy matrices in the ensemble based on the fuzzy membership function and base samples selected by AP. HFCEF-II adopts AP to perform clustering on the attribute dimension, generates a set of subspaces, and obtains a set of fuzzy matrices in the ensemble by performing fuzzy c-means on subspaces. Compared with HFCEF-I and HFCEF-II, HFCEF-III and HFCEF-IV consider the characteristics of HFCEF-I and HFCEF-II. HFCEF-III combines HFCEF-I and HFCEF-II in a serial way, while HFCEF-IV integrates HFCEF-I and HFCEF-II in a concurrent way. HFCEFs adopt suitable consensus functions, such as the fuzzy c-means algorithm or the normalized cut algorithm (Ncut), to summarize generated fuzzy matrices, and obtain the final results. The experiments on real data sets from UCI machine learning repository and cancer gene expression profiles illustrate that 1) the proposed hybrid fuzzy cluster ensemble frameworks work well on real data sets, especially biomolecular data, and 2) the proposed approaches are able to provide more robust, stable, and accurate results when compared with the state-of-the-art single clustering algorithms and traditional cluster ensemble approaches.

Class discovery from gene expression data based on perturbation and cluster ensemble.

Knowledge Based Cluster Ensemble for Cancer Discovery from Biomolecular Data

Graph-based Consensus Clustering for Class Discovery from Gene Expression Data

Neural Gas Based Cluster Ensemble Algorithm and Its Application to Cancer Data

An Ensemble Method of Discovering Sample Classes Using Gene Expression Profiling

Penalty-based cluster validity index for class discovery from cancer data.

Hybrid Fuzzy Cluster Ensemble Framework for Tumor Clustering from Biomolecular Data

Finding disagreement pathway signatures and constructing an ensemble model for cancer classification

Class Discovery Based on K-means Clustering and Perturbation Analysis

SC³: Triple Spectral Clustering-Based Consensus Clustering Framework for Class Discovery from Cancer Gene Expression Profiles

Molecular pattern discovery based on penalized matrix decomposition.

An Ensemble Method for Gene Discovery Based on DNA Microarray Data.

Tumor clustering based on hybrid cluster ensemble framework

Selective Ensemble Classification Integrated with Affinity Propagation Clustering

Clustering Analysis Of Microarray Gene Expression Data With New Clustering Ensemble Method

A Method for Cancer Classification Using Ensemble Neural Networks with Gene Expression Profile

A Graph Informed Framework Empowering Gene Pathway Discovery and Gene Expression-Assisted Disease Classification

Computational intelligence approach for gene expression data mining and classification

Inference on differences between classes using cluster-specific contrasts of mixed effects

Biology-constrained gene expression discretization for cancer classification.

Ensemble Classification Based Signature Discovery for Cancer Diagnosis in RNA Expression Profiles Across Different Platforms