Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution.

Jinzhong Lin,Junbiao Pang,Li Su,Yugui Liu,Qingming Huang
DOI: https://doi.org/10.1007/978-3-030-05710-7_49
2019-01-01
Abstract:Organizing webpages into hot topics is one of the key steps to understand the trends from multi-modal web data. To handle this pressing problem, Poisson Deconvolution (PD), a state-of-the-art method, recently is proposed to rank the interestingness of web topics on a similarity graph. Nevertheless, in terms of scalability, PD optimized by expectation-maximization is not sufficiently efficient for a large-scale data set. In this paper, we develop a Stochastic Poisson Deconvolution (SPD) to deal with the large-scale web data sets. Experiments demonstrate the efficacy of the proposed approach in comparison with the state-of-the-art methods on two public data sets and one large-scale synthetic data set.
What problem does this paper attempt to address?