Research on Design of Novelty Retrieval Aided Analysis System Based on LDA Model

Linshan Ma,Lei Guo
DOI: https://doi.org/10.3969/j.issn.1008-0821.2018.02.018
2018-01-01
Abstract:This paper summarized the calculation principle and method of Latent Dirichlet Allocation(LDA), and the treatment flowsheet using the fast collapsed Gibbs sampling's algorithm to analyze the corpus in open source R language. The paper designed the function framework of the novelty retrieval aided analysis system based on LDA model, and de?scribed its functions, programming mentality and workflow. Finally, with a novelty retrieval case, this paper explained the basic process of using LDA model, mining potential theme using the keywords of relevant literature, comparing comparative analysis the subject of research content, giving an objective to the research topic. The results showed that the novelty re?trieval aided analysis system based on LDA could quickly and effectively mining related literature, reduced the difficulty of analyzing relevant literature topics to Novelty Consultant, improved the objectivity of evaluation subject. The overall analy?sis effect was good.
What problem does this paper attempt to address?