Acoustic Scene Classification by Enhanced High-Frequency Weak Signal Characteristic Spectrum.

Wenjie Hao,Lasheng Zhao,Qiang Zhang
DOI: https://doi.org/10.1145/3373509.3373511
2019-01-01
Abstract:In the acoustic scene classification task, the method of using mel-spectrogram to express the acoustic scene information is widely applied. However, mel-spectrogram has defects and it ignores important information about some acoustic scenes. The paper improves the mel-spectrogram and its generation algorithm. Including: I. For the sensitivity of the acoustic scene to high-frequency acoustic signals, the paper changes the filter design method of Mel Frequency Cepstrum Coefficient (MFCC). This method preserves more high frequency information by applying the equal-height triangular filter banks and increasing the number of the filters. II. Based on the previous step, an enhancement algorithm is proposed for the problem of the lack of high-frequency weak signals in the characteristic spectrum. The algorithm performs nonlinear mapping on the mel-spectrogram, which makes the transformed high-frequency weak signal feature information more obvious. The algorithm is verified by DCASE 2018 acoustic scene classification dataset and LITIS ROUEN dataset. The experimental results demonstrate the effectiveness of the proposed algorithm.
What problem does this paper attempt to address?