Gene Markers Identification Algorithm for Detecting Colon Cancer Patients

Juanying XIE,Wen FAN
DOI: https://doi.org/10.16451/j.cnki.issn1003-6059.201711007
2018-01-01
Abstract:To detect those few informative genes with strong classification information and identify colon cancer patients as correctly as possible, an algorithm is proposed in this paper to identify the gene markers for detecting colon cancer patients. The densities and distances are defined for genes firstly. All genes are scattered in a 2D space with gene density and distance as X-axis and Y-axis, respectively. Those genes at high density peaks are selected to construct the optimal gene subset. Then, those samples only with genes in the optimal gene subset of colon dataset are clustered by DP K-medoids clustering algorithm. The distances between genes or samples are calculated via Euclidean distance, Manhattan distance,Chebyshev distance and the cosine distance, respectively. The experimental results demonstrate that the proposed algorithm can find the optimal gene subset of colon cancer with high accuracy, sensitivity, specificity and MCC, and with a very few number of genes as well.
What problem does this paper attempt to address?