Abstract:In this paper, we present a frequency band threshold based on wavelet transform (FBT) noise cancellation method. The noise cancellation is enable to improve on the articulation of the speech. Although the edge information of the speech is very important for recognition system to use, most traditional noise cancellation methods based on spectrum analysis smooth these edges of the original speech. We hope to get a noise cancellation method that keeps these edges information. We knew that the performance of edge detection based on wavelet transform is very high. So we use wavelet transform for noise cancellation. Noise cancellation methods based on wavelet transform were referred to papers (1)(2). The method was given by paper (1) is not real-time. Hence this method is difficult to be used a practical system. Although the real-time property of the noise cancellation method was referred to paper (2) is perfect, the aural performance is defective. This method has a single threshold (ST). It ignored the difference of the frequency bands. FBT is presented by us in this paper possesses two characteristics as follow: (1) These thresholds depend on frequency bands. (2) These thresholds are self-adjusting. Based on two judgement standards---signal noise rate (impersonal standard) and the articulation of the speech (subjective standard), we did comparison experiments between FBT and ST. Although FBT's signal noise rate inferior to the ST's, FBT's waveform distortion is less than ST's and FBT's articulation of the speech is remarkable superior to the ST's. We particularly analyzed the causes of the phenomena and did the comparison experiments of these two methods on the same speech recognition system. The conclusion is FBT is superior to ST.

Noise-robust speech recognition based on difference of power spectrum

Noise Estimation Using Mean Square Cross Prediction Error for Speech Enhancement

Robust speech recognition in noisy backgrounds based on Teager energy operator and auditory process

The predictive differential amplitude spectrum for robust speaker recognition in stationary noises

Robust Speech Recognition by Selecting Mel-Filter Banks

Modified MFCCs for Robust Speaker Recognition

A Robust Speech Feature - Perceptive Scalogram Based on Wavelet Analysis

speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition

Enhancement of Non-air Conduct Speech Based on Multi-band Spectral Subtraction Method

Research on Speech Recognition Methods Based on Spectral Subtraction and Improved BP Neural Network Algorithm

Speech Enhancement Algorithm Based on Spectral Subtraction

Effective Speech Endpoint Detection Algorithm For Voiceprint Recognition

Speech Enhancement Approach Based on Minimum Estimate and Spectral Subtraction

Speech Enhancement Based on Short-Time Spectral Amplitude Estimates in Low SNR

Research on Speech Enhancement Method Based on Fuzzy System

Robust Speech And Non-Speech Detection

A Speech Enhancement Algorithm Based On Non-Linear Filtering And Noise Masking

An Auditory Feature Extraction Method Based on Forward-Masking and Its Application in Robust Speaker Identification and Speech Recognition.

Analysis of noise robustness of auditory features in speech recognition

A NOISE CANCELLATION METHOD BASED ON WAVELET TRANSFORM

A Noise Robust Front End Algorithm for Mandarin Speech Recognition and Performance Analysis