The Research of Bibliographic Information Classification Method Based on the Composite Weighted LDA Model

Xiangdong Li,Cong Ding,Fan Gao
DOI: https://doi.org/10.3772/j.issn.1000-0135.2017.04.004
2017-01-01
Abstract:Bibliographic information for the study of automatic classification of information resources organization is of great significance.This paper based on the probability model of LDA as bibliographic information of text representation model,to overcome the characteristics of sparse problems because of short text;By the type of the style of bibliographic information structure and the feature word to distinguish category ability,two different kinds of feature weighting strategy is implemented,on the basis of building composite weighted strategy,to obtain a set of feature words does not tilt to high-frequency words,also more representative bibliography information category.Composite weighted fusion strategy in LDA,put forward a kind of bibliographic information classification method based on the complex weighted LDA.Introducing composite weighted strategy to LDA model,propose a bibliographic information classification method based on the composite weighted LDA model.Using bibliographic information of public and self-built corpora for contrast experiment,verification and analysis of the effectiveness of the composite weighted strategy,experiments show that the proposed composite weighted LDA classification performance of the method is better than that of only considering one feature weighting strategy LDA classification method.
What problem does this paper attempt to address?