Topic evolution based on LDA and HMM and its application in stem cell research

QingQiang Wu,CaiDong Zhang,QingQi Hong,LiYan Chen
DOI: https://doi.org/10.1177/0165551514540565
2014-06-12
Journal of Information Science
Abstract:This paper analyses topic segmentation based on the LDA (Latent Dirichlet Allocation) model, and performs the topic segmentation and topic evolution of stem cell research literatures in PubMed from 2001 to 2012 by combining the HMM (Hidden Markov Model) and co-occurrence theory. Stem cell research topics were obtained with LDA and expert judgements made on these topics to test the feasibility of the model classification. Further, the correlation between topics was analysed. HMM was used to predict the trend evolution of topics over various years, and a time series map was used to visualize the evolutional relationships among the stem cell topics.
computer science, information systems,information science & library science
What problem does this paper attempt to address?