Clustering, Leaf-ordering and Visualization for Intuitive Analysis of Deoxyribonucleic-Acid Chip Data 15
Dennis Muhlestein,SeungJin Lim,Sang-Soo Yeo,Gil-Cheol Park,Sung Kwon Kim,Tai-hoon Kim,Sung Kwon Kim,Haiguang Chen,Gangfeng Gu,Huafeng Wu,Chuanshan Gao
2007-01-01
Abstract:Generally the result data from DNA chip experiments have lots of gene expression information. Scientists want to get perspective insight or want to find intuitive fact from that data. Hierarchical clustering is the most widely used method for analysis of gene expression data. In this paper, we address leaf-ordering, which is a post-processing for the dendrograms - a sort of edge-weighted binary trees - created by hierarchical clustering and we present a new approach for leaf-ordering scheme. And we show the comparison results for our approach and the existing approach. DNA chip experiment technology encourages producing huge sets of gene expression data in these days. DNA chip technology can be applied to functional genomics, genetic network, and association analysis and then it seems to be one of important methods for medical research fields such as genetic disease findings, new medicine development. For accelerating these advantages, it is very important to process and to visualize gene expression data from DNA chip experiments. In other words, a large set of gene expression data could be meaningful information for scientists under the condition that it should be processed by computational methods and be presented by intuitive graphical representation. DNA expression data clustering problem was evolved for this purpose. The key point of the problem is divide dataset into a number of clusters; gathering genes with similar expression pattern together and separating genes with different expression pattern. Clustering enables to predict functions of undiscovered genes by looking their involved clusters and it helps to find a gene set regulating a specific disease by comparing gene expression patterns of health persons and of patients.