Linear Histogram Equalization in the Acoustic Feature Domain for Speech Recognition over Bluetooth™ Channels

Ke Peng,Hongbin Cai,Yaxin Zhang
DOI: https://doi.org/10.1145/1378063.1378130
2007-01-01
Abstract:This paper studies the improvement of speech recognition over Bluetooth™ wireless channels. Speech recognition over Bluetooth™ suffers from the low SNR due to the position of the Bluetooth™ microphone, Bluetooth™ codec distortion, packet loss over the wireless channel, and Bluetooth™ channel distortion. By transforming the MFCCs (Mel-Frequency Cepstral Coefficients) to make the cumulative density functions of the MFCC values in recognition match the ones that were estimated on the training data, the recognition can be improved. The cumulative density functions are approximated using a small number of quantiles. Recognition tests on a Bluetooth™ speech database showed significant increase of recognition accuracy in noisy environments.
What problem does this paper attempt to address?