Abstract:This paper proposed a novel speech processing algorithm in cochlear implant, which used harmonicity cues to enhance tonal information in Mandarin Chinese speech recognition. The input speech was filtered by a 4-channel band-pass filter bank. The frequency ranges for the four bands were: 300-621, 621-1285, 1285-2657, and 2657-5499 Hz. In each pass band, temporal envelope and periodicity cues (TEPCs) below 400 Hz were extracted by full wave rectification and low-pass filtering. The TEPCs were modulated by a sinusoidal carrier, the frequency of which was fundamental frequency (F0) and its harmonics most close to the center frequency of each band. Signals from each band were combined together to obtain an output speech. Mandarin tone, word, and sentence recognition in quiet listening conditions were tested for the extensively used continuous interleaved sampling (CIS) strategy and the novel F0-harmonic algorithm. Results found that the F0-harmonic algorithm performed consistently better than CIS strategy in Mandarin tone, word, and sentence recognition. In addition, sentence recognition rate was higher than word recognition rate, as a result of contextual information in the sentence. Moreover, tone 3 and 4 performed better than tone 1 and tone 2, due to the easily identified features of the former. In conclusion, the F0-harmonic algorithm could enhance tonal information in cochlear implant speech processing due to the use of harmonicity cues, thereby improving Mandarin tone, word, and sentence recognition. Further study will focus on the test of the F0-harmonic algorithm in noisy listening conditions.

Harmonic Intensity Feature for Robust Speech Recognition

An Efficient Robust Asr System Based On The Combination Of Speech Enhancement And Hmm Adaptation

Harmonic Detection from Noisy Speech with Auditory Frame Gain for Intelligibility Enhancement

Robust Speech Recognition by Selecting Mel-Filter Banks

Modified MFCCs for Robust Speaker Recognition

Analysis of noise robustness of auditory features in speech recognition

Robust F0 Modeling for Mandarin Speech Recognition in Noise.

A Noise Robust Front End Algorithm for Mandarin Speech Recognition and Performance Analysis

On the Importance of Components of the MFCC in Speech and Speaker Recognition.

High Performance Digit Mandarin Speech Recognition

Robust Audio-Visual Mandarin Speech Recognition Based on Adaptive Decision Fusion and Tone Features

Harmonic and non-Harmonic Based Noisy Reverberant Speech Enhancement in Time Domain

Statistical Thresholding for Robust ASR

Accent Recognition with Hybrid Phonetic Features

Auditory Features Based on Gammatone Filters for Robust Speech Recognition.

Compensation of Speech Enhancement Distortion for Robust Speech Recognition

A Novel Speech Processing Algorithm Based on Harmonicity Cues in Cochlear Implant

Multi-resolution Time Frequency Feature and Complementary Combination for Short Utterance Speaker Recognition

Design and implementation of speech recognition algorithm based on frequency range

Robust speech recognition in noisy backgrounds based on Teager energy operator and auditory process

Comparing the Perceptual Contributions of Cochlear-Scaled Entropy and Speech Level