An enhanced RASTA filtering of speech

甄斌,吴玺宏,刘志敏,迟惠生
DOI: https://doi.org/10.3321/j.issn:0371-0025.2001.03.011
2001-01-01
Abstract:We propose an Enhanced RASTA (E_RASTA) technique for speech and speaker recognition. The new method consists of classical RASTA filtering in logarithmic spectrum domain following by another RASTA processing in spectrum domain. In this manner, both the channel distortion and additive noise are removed effectively. In isolated digit speaker identification and speech recognition experiment on TI46 database, we found that the E_RASTA performed equally or better than J_RASTA method in both tasks. The E_RASTA does not need the speech SNR estimation in order to determinate the optimal value of J in J_RASTA, and the information of how the speech degrades. The choice of E_RASTA filters also indicates that the low frequency modulation components in degraded speech can deteriorate the performance of both recognition tasks. Besides, the speaker recognition needs less temporal modulation frequency band than the speech recognition.
What problem does this paper attempt to address?