Semi-supervised Max-margin Topic Model with Manifold Posterior Regularization.

Wenbo Hu,Jun Zhu,Hang Su,Jingwei Zhuo,Bo Zhang
DOI: https://doi.org/10.24963/ijcai.2017/259
2017-01-01
Abstract:Supervised topic models leverage label information to learn discriminative latent topic representations. As collecting a fully labeled dataset is often time-consuming, semi-supervised learning is of high interest. In this paper, we present an effective semi-supervised max-margin topic model by naturally introducing manifold posterior regularization to a regularized Bayesian topic model, named LapMedLDA. The model jointly learns latent topics and a related classifier with only a small fraction of labeled documents. To perform the approximate inference, we derive an efficient stochastic gradient MCMC method. Unlike the previous semi-supervised topic models, our model adopts a tight coupling between the generative topic model and the discriminative classifier. Extensive experiments demonstrate that such tight coupling brings significant benefits in quantitative and qualitative performance.
What problem does this paper attempt to address?