Abstract:We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to-noise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized time-frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time-frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.

Speech enhancement based on soft audible noise masking and noise power estimation

Speech Enhancement Based on Masking Properties and Short-Time Spectral Amplitude Estimation

Speech Enhancement Based on Short-Time Spectral Amplitude Estimates in Low SNR

Speech enhancement algorithm based on noise estimation of binary masking

Speech Enhancement Based on Noise Estimation and Auditory Masking Properties

Study of Speech Enhancement Algorithm Based on Auditory Masking Effect

A speech enhancement method based on wavelet packet and hearing masking effect

A Speech Enhancement Method Based on Signal Subspace and Hearing Masking Effect

A Speech Enhancement Algorithm Based On Non-Linear Filtering And Noise Masking

Speech Enhancement based on Human Auditory Masking Properties under Non-stationary Environments

Single-Channel Speech Enhancement Based on Psychoacoustic Masking

A generalized time-frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system

An Improved Method for Speech Enhancement Based on Human Auditory Masking Properties

Speech Enhancement Algorithm Based on Auditory Masking Effect and Optimal Smoothing

A Speech Enhancement Method Based on Noise Estimation with Rapid Adaptation

A Modified Spectral Subtraction Method For Speech Enhancement Based On Masking Property Of Human Auditory System

Noise Estimation Using Mean Square Cross Prediction Error for Speech Enhancement

A model-based soft decision approach for speech enhancement

A More Effective Speech Enhancement Algorithm under Non-Stationary Noise Environment

Speech Enhancement Technique on Poor Noise Estimation

MMSE Speech Enhancement Algorithm Based on Masking Properties