Robust Audio Fingerprinting Based On Local Spectral Luminance Maxima Scheme

Yongzhe Shi,Weiqiang Zhang,Jia Liu
DOI: https://doi.org/10.21437/interspeech.2011-636
2011-01-01
Abstract:This paper proposes a robust audio fingerprinting system based on local spectral luminance maxima (LSLM) scheme using image processing approaches. Our approach treats spectrogram of an audio clip as a 2-D image and extracts the local luminance maxima of spectrum image as the discriminative characteristics. LSLM are selected due to resilience against quantization, compression, and noise addition, etc. Experimental results show that the proposed binary audio fingerprints outperform some of the state-of-the-art in the context of both robustness and reliability, especially in the noisy environment.
What problem does this paper attempt to address?