Abstract:In general, in-car speech enhancement is an application of the microphone array speech enhancement in particular acoustic environments. Speech enhancement inside the moving cars is always an interesting topic and the researchers work to create some modules to increase the quality of speech and intelligibility of speech in cars. The passenger dialogue inside the car, the sound of other equipment, and a wide range of interference effects are major challenges in the task of speech separation in-car environment. To overcome this issue, a novel Beamforming based Deep learning Network (Bf-DLN) has been proposed for speech enhancement. Initially, the captured microphone array signals are pre-processed using an Adaptive beamforming technique named Least Constrained Minimum Variance (LCMV). Consequently, the proposed method uses a time-frequency representation to transform the pre-processed data into an image. The smoothed pseudo-Wigner-Ville distribution (SPWVD) is used for converting time-domain speech inputs into images. Convolutional deep belief network (CDBN) is used to extract the most pertinent features from these transformed images. Enhanced Elephant Heard Algorithm (EEHA) is used for selecting the desired source by eliminating the interference source. The experimental result demonstrates the effectiveness of the proposed strategy in removing background noise from the original speech signal. The proposed strategy outperforms existing methods in terms of PESQ, STOI, SSNRI, and SNR. The PESQ of the proposed Bf-DLN has a maximum PESQ of 1.98, whereas existing models like Two-stage Bi-LSTM has 1.82, DNN-C has 1.75 and GCN has 1.68 respectively. The PESQ of the proposed method is 1.75%, 3.15%, and 4.22% better than the existing GCN, DNN-C, and Bi-LSTM techniques. The efficacy of the proposed method is then validated by experiments.

Multi-channel Speech Enhancement in Driving Environment

Speech Enhancement with Generalized Sidelobe Canceller Based on a Coherence-based Filter for Small Microphone Arrays

GSC-like Speech Enhancement for Dual Small Microphone Array

A GSC Algorithm Based on Null Spectral Subtraction for Dual Small Microphone Array Speech Enhancement

Low-SNR Speech Enhancement and Separation in Driving Environment

Microphone array processing via joint wideband angle-of-arrival estimation and speech feature enhancement

Service Robot Speech Enhancement Method Using Acoustic Micro-Sensor Array

Speech Enhancement Using an Equivalent Source Inverse Filtering-Based Microphone Array

Speech Enhancement with a GSC-like Structure Employing Sparse Coding

Speech Enhancement by Denoising and Dereverberation Using a Generalized Sidelobe Canceller-Based Multichannel Wiener Filter

Research of Speech Enhancement Based on Microphone Array

A Dual Microphone Speech Enhancement Method With A Smoothing Parameter Mask

Deep neural network-based generalized sidelobe canceller for dual-channel far-field speech recognition

Distributed-microphones based in-vehicle speech enhancement via sparse and low-rank spectrogram decomposition

MICROPHONE ARRAY SYSTEM BASED ON ROBUST ADAPTIVE BEAMFORMING

Microphone Array Speech Enhancement Via Beamforming Based Deep Learning Network

GSVD-based Generalized Sidelobe Canceller

A Dual-Microphone Speech Enhancement Algorithm for Close-Talk System

FDM Array-Based Dual Channel Speech Enhancement Method

Robust Adaptive Microphone Array Algorithm for Speech Recognition

Exploring Conventional Enhancement and Separation Methods for Multi‐speech Enhancement in Indoor Environments