A Speech Enhancement Method Based on Dual-Path Phase-Aware GAN Networks

Chenghao Zhuang,Yunling Cheng,Qirui Wang,Lin Zhou,Yanxiang Cao
DOI: https://doi.org/10.1109/ICCCAS62034.2024.10652640
2024-05-10
Abstract:In the current research on speech enhancement, the estimation of the complex spectrum of clean speech from the noisy one, i.e., the real and imaginary parts, or the amplitude and phase, has become the mainstream in order to improve the intelligibility and perceptual quality of reconstructed speech. The accuracy of phase estimation has been limited due to phase unwrapping. To address this problem, this paper proposes a dual-path GAN structure. In this structure, one path is dedicated to estimate the amplitude of speech spectrum, and the other path focuses on the phase part. At the same time, those two paths undergo information interaction, which improves the accuracy of the amplitude estimation and phase estimation. Also, we propose a novel phase loss function, which focus on stable phase estimation maintaining phase continuity. Simulation results show that the performance of the proposed speech enhancement algorithm has been significantly improved, which verifies the effectiveness of the method in this paper.
Computer Science
What problem does this paper attempt to address?