Utterance-Level Latent Topic Transition Modeling for Spoken Documents and Its Application in Automatic Summarization

Hung-yi Lee,Yun-nung Chen,Lin-shan Lee
DOI: https://doi.org/10.1109/icassp.2012.6289059
2012-01-01
Abstract:In this paper, we propose to use an utterance-level latent topic transition model to estimate the latent topics behind the utterances, and test the performance of such model in extractive speech summarization. In this model, the latent topic weights behind an utterance are estimated, and these topic weights evolve from an utterance to the next in a spoken document based on a topic transition function represented by a matrix. We explore different ways of obtaining such topic transition matrices used in the model, and find using a set of matrices estimated with utterances clustered from a training spoken document set is very useful. This model was shown to be able to offer extra performance improvement when used with the popularly used Probability Latent Semantic Analysis (PLSA) in preliminary experiments on speech summarization.
What problem does this paper attempt to address?