Abstract:In general, in-car speech enhancement is an application of the microphone array speech enhancement in particular acoustic environments. Speech enhancement inside the moving cars is always an interesting topic and the researchers work to create some modules to increase the quality of speech and intelligibility of speech in cars. The passenger dialogue inside the car, the sound of other equipment, and a wide range of interference effects are major challenges in the task of speech separation in-car environment. To overcome this issue, a novel Beamforming based Deep learning Network (Bf-DLN) has been proposed for speech enhancement. Initially, the captured microphone array signals are pre-processed using an Adaptive beamforming technique named Least Constrained Minimum Variance (LCMV). Consequently, the proposed method uses a time-frequency representation to transform the pre-processed data into an image. The smoothed pseudo-Wigner-Ville distribution (SPWVD) is used for converting time-domain speech inputs into images. Convolutional deep belief network (CDBN) is used to extract the most pertinent features from these transformed images. Enhanced Elephant Heard Algorithm (EEHA) is used for selecting the desired source by eliminating the interference source. The experimental result demonstrates the effectiveness of the proposed strategy in removing background noise from the original speech signal. The proposed strategy outperforms existing methods in terms of PESQ, STOI, SSNRI, and SNR. The PESQ of the proposed Bf-DLN has a maximum PESQ of 1.98, whereas existing models like Two-stage Bi-LSTM has 1.82, DNN-C has 1.75 and GCN has 1.68 respectively. The PESQ of the proposed method is 1.75%, 3.15%, and 4.22% better than the existing GCN, DNN-C, and Bi-LSTM techniques. The efficacy of the proposed method is then validated by experiments.

Cnn-Based Virtual Microphone Signal Estimation For Mpdr Beamforming In Underdetermined Situations

Neural network-based virtual microphone estimation with virtual microphone and beamformer-level multi-task loss

Enhancing Mmwave Beam Prediction Through Deep Learning with Sub-6 GHz Channel Estimate

Design of a robust MVDR beamforming method with Low-Latency by reconstructing covariance matrix for speech enhancement

Attention-Based Beamformer For Multi-Channel Speech Enhancement

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Neural Spatio-Temporal Beamformer for Target Speech Separation

A circular microphone array with virtual microphones based on acoustics-informed neural networks

A High-Resolution and Low-Frequency Acoustic Beamforming Based on Bayesian Inference and Non-Synchronous Measurements

Adaptive Beamforming Based on Interference-Plus-Noise Covariance Matrix Reconstruction for Speech Separation

Subspace Hybrid MVDR Beamforming for Augmented Hearing

NEW ROBUST ADAPTIVE BEAMFORMING METHOD FOR MULTIPATH COHERENT SIGNAL RECEPTION

Subspace Hybrid Beamforming for Head-worn Microphone Arrays

Multichannel Speech Enhancement without Beamforming

Deep Learning Based Speech Beamforming

Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising

A Speech Enhancement Method Combining Beamforming with RNN for Hearing Aids.

Microphone Array Speech Enhancement Via Beamforming Based Deep Learning Network

Unsupervised Improved MVDR Beamforming for Sound Enhancement

ADL-MVDR: All deep learning MVDR beamformer for target speech separation

Microphone Subset Selection for MVDR Beamformer Based Noise Reduction