Auditory Scene Analysis and Recognition with Lda Topic Model

Feng Su
DOI: https://doi.org/10.1109/icme.2014.6890241
2014-01-01
Abstract:Analysis and recognition of auditory scenes play an important role in content-based multimedia processing and context-aware applications. In this paper, we propose an auditory scene recognition scheme that integrates the analysis of the audio data of scene with LDA topic model to discover latent structures (i.e. contextual correlations) of audio words, and generation of intermediate contextual descriptions of audio data on basis of the topics learnt by LDA. We further combine the piecewise low-level audio feature and the contextual feature, and discriminatively classify an audio clip of an unknown scene that is represented as a set of these features using the Hough forest model. The experimental results demonstrate the effectiveness of the proposed scheme, which combines the unsupervised topic modeling by LDA and the supervised classification of auditory scene by Hough forest.
What problem does this paper attempt to address?