Multichannel Speech Enhancement in Vehicle Environment Based on Interchannel Attention Mechanism

Xueli Shen,Zhenxing Liang,Shiyin Li,Yanji Jiang
DOI: https://doi.org/10.1155/2021/9453911
IF: 2.249
2021-01-01
Journal of Advanced Transportation
Abstract:Speech enhancement in a vehicle environment remains a challenging task for the complex noise. The paper presents a feature extraction method that we use interchannel attention mechanism frame by frame for learning spatial features directly from the multichannel speech waveforms. The spatial features of the individual signals learned through the proposed method are provided as an input so that the two-stage BiLSTM network is trained to perform adaptive spatial filtering as time-domain filters spanning signal channels. The two-stage BiLSTM network is capable of local and global features extracting and reaches competitive results. Using scenarios and data based on car cockpit simulations, in contrast to other methods that extract the feature from multichannel data, the results show the proposed method has a significant performance in terms of all SDR, SI-SNR, PESQ, and STOI.
What problem does this paper attempt to address?