GLDA: Parallel Gibbs Sampling for Latent Dirichlet Allocation on GPU.

Pei Xue,Tao Li,Kezhao Zhao,Qiankun Dong,Wenjing Ma
DOI: https://doi.org/10.1007/978-981-10-2209-8_9
2016-01-01
Abstract:With the development of the general computing ability of GPU, more and more algorithms are being run on GPU, to enjoy much higher speed. In this paper, we propose an approach that uniformly accelerate Gibbs Sampling for LDA (Latent Dirichlet Allocation) algorithm on GPU, which makes the data load to the cores of GPU evenly to avoid the idle waiting for GPU, and improves the utilization of GPU. We use three text mining datasets to test the algorithm. Experiments show that our parallel methods can achieve about 30x speedup over sequential training methods with similar prediction precision. Furthermore, the idea that uniformly partitioning the data bases on GPU can also be applied to other machine learning algorithms.
What problem does this paper attempt to address?