NeuralKalman: A Learnable Kalman Filter for Acoustic Echo Cancellation

Yixuan Zhang,Meng Yu,Hao Zhang,Dong Yu,DeLiang Wang
2023-12-26
Abstract:The robustness of the Kalman filter to double talk and its rapid convergence make it a popular approach for addressing acoustic echo cancellation (AEC) challenges. However, the inability to model nonlinearity and the need to tune control parameters cast limitations on such adaptive filtering algorithms. In this paper, we integrate the frequency domain Kalman filter (FDKF) and deep neural networks (DNNs) into a hybrid method, called NeuralKalman, to leverage the advantages of deep learning and adaptive filtering algorithms. Specifically, we employ a DNN to estimate nonlinearly distorted far-end signals, a transition factor, and the nonlinear transition function in the state equation of the FDKF algorithm. Experimental results show that the proposed NeuralKalman improves the performance of FDKF significantly and outperforms strong baseline methods.
Audio and Speech Processing,Sound
What problem does this paper attempt to address?
The paper aims to address the challenging issues in Acoustic Echo Cancellation (AEC). Specifically, the goal of the paper is to develop a hybrid model by introducing Deep Neural Networks (DNN) based on the traditional Kalman filtering algorithm, to fully leverage the advantages of the frequency domain Kalman filtering algorithm and DNN. The research focuses on the following aspects: 1. **Nonlinear Distortion**: Traditional adaptive filtering algorithms fail to effectively handle the nonlinear distortion caused by speakers. 2. **Covariance Estimation**: In Kalman filtering, the covariance of state noise and observation noise is usually approximated, which may affect the performance of the algorithm. 3. **Transition Factor**: In the state equation of Kalman filtering, the transition factor is usually assigned a fixed value, which makes it difficult to adapt to environmental changes. To address the above issues, the paper proposes the NeuralKalman framework, which utilizes DNN to estimate nonlinear distortion, transition factors, and nonlinear transformation functions in the state equation. Experimental results show that the proposed NeuralKalman method has better robustness in double-talk scenarios and outperforms existing baseline methods in echo suppression.