Mixture Model-based Text Clustering:A Review

WangFang,ChengYing,KeQing
DOI: https://doi.org/10.3772/j.issn.1000-0135.2015.005.010
2015-01-01
Abstract:Model-based clustering has attracted more and more attention,and empirical studies also showed distinct advantage. This paper reviews the status of the document clustering based on mixture models. According to the technical routes,it summarizes three main parts,such as document modeling,parameter modeling,and model inference,and analyses the c o m m o n problems in different researches,including feature reduction,semi-supervised clustering and the integration of clustering process. At last it presents possible future research directions in this field.
What problem does this paper attempt to address?