An Efficient and Interpre Table Speech Enhancement Network Via Deep Dictionary Learning.

Xinmeng Xu,Yiqun Zhang,Weiping Tu,Yuhong Yang
DOI: https://doi.org/10.1109/ICASSP48485.2024.10447188
2024-01-01
Abstract:Speech enhancement is a vital and highly ill-posed problem for many speech downstream tasks. While currently existing deep learning based speech enhancement methods have held state-of-the-art results, they still possess apparent shortcomings in that most of the deep learning based models lack interpretability. This deficiency results in unsatisfied speech enhancement performance in many sophisticated scenarios. To tackle this problem, we integrate dictionary learning and sparse coding into deep learning networks for speech enhancement and present a deep dictionary learning based speech enhancement network (DicLSENet). Specifically, the proposed DicLSENet strictly follows the principle of dictionary learning, learns the priors for both representation coefficients and dictionaries, and adaptively adjusts the dictionary for each input. Experimental results show that the proposed model outperforms state-of-the-art fully deep learning based methods with attractive computational costs.
What problem does this paper attempt to address?