Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning

Hao Li,DeLiang Wang,Xueliang Zhang,Guanglai Gao
DOI: https://doi.org/10.21437/interspeech.2020-2475
2020-01-01
Abstract:This study investigates deep learning based signal-to-noise ratio (SNR) estimation at the frame level. We propose to employ recurrent neural networks (RNNs) with long short-term memory (LSTM) in order to leverage contextual information for this task. As acoustic features are important for deep learning algorithms, we also examine a variety of monaural features and investigate feature combinations using Group Lasso and sequential floating forward selection. By replacing LSTM with bidirectional LSTM, the proposed algorithm naturally leads to a long-term SNR estimator. Systematical evaluations demonstrate that the proposed SNR estimators significantly outperform other frame-level and long-term SNR estimators.
What problem does this paper attempt to address?