Advanced transient noise reduction in speech signals via semi-supervised signal fusion

Rizwan Ullah,Shaohui Zhang
DOI: https://doi.org/10.1016/j.apacoust.2024.109882
IF: 3.614
2024-01-29
Applied Acoustics
Abstract:Speech enhancement (SE) by transient noise suppression refers to the process of estimating the desired signal from a transient noise (TN)-corrupted speech signal. It has important applications in speaker verification and identification, voice-based biometric systems, hearing aids, video conferences, and many others. Enhancement of TN-corrupted signals is both important and challenging due to their high randomness, high short-time energy, and wide frequency domain distribution. This paper presents a novel approach to SE that leverages the power of sequential sparse nonnegative matrix factorization (SNMF), enhanced dictionary learning (DL), and Gini index (GI)-based fusion. We use a multi-step approach to enhance noisy speech. Our approach uses semi-supervised learning since the TN used for dictionary training is derived from the noisy signal using optimally-modified log-spectral amplitude (OMLSA) and is then purified. We involve both the dictionaries of the noisy signal and external clean speech in the DL process to merge the strengths of both dictionaries. Firstly, the dictionary of the noisy signal is obtained through SNMF and is then decomposed into speech-dominant and noise-dominant submatrices using semi-supervised learning. Then this speech-dominant submatrix is combined with the clean speech dictionary to construct an improved speech dictionary. This enhanced dictionary and the external clean speech dictionary are then used for the SE in the testing phase, resulting in two different initial estimated signals. To improve the accuracy of the initial estimates, we apply the GI to obtain the final estimate. Furthermore, the noisy signal's phase is also enhanced. A significant increase is observed in the results through different evaluation measures.
acoustics
What problem does this paper attempt to address?