COMPUTE PAIRWISE EUCLIDEAN DISTANCES OF DATA POINTS WITH GPUS

Darjen Chang,Nathaniel A. Jones,Dazhuo Li,Ming Ouyang,K. Ragade
2008-01-01
Abstract:ABSTRACT Graphics processing units (GPUs) are powerful computa- tional devices tailored toward the needs of the 3-D gam- ing industry for high-performance, real-time graphics en- gines. Nvidia released a new generation of GPUs designed for general-purpose computing in 2006, and a GPU pro- gramming,language called CUDA in 2007. The DNA mi- croarray technology is a high throughput tool for assaying gene expression of cell cultures or tissue samples. During the exploratory phase of data analysis, scientists often ap- ply (agglomerative)hierarchical clustering on the genes. In hierarchicalclustering, a fundamentaloperationis to calcu- late all pairwise distances among,all genes. If there are n genes, it takes O(n,) time. In the present study, we exam- ine how to use GPUs and the CUDA language to speed up the calculation. The results achieve a 20 to 44 times speed- up on the GPU compared,to the CPU implementation. KEY WORDS High performance biocomputing, parallel and distributed computation, microarray data analysis, hierarchical clus- tering
What problem does this paper attempt to address?