Design of an Ultra-Low Power MFCC Feature Extraction Circuit with Embedded Speech Activity Detector

Kaiyue Yang,Lixuan Zhu,Weiwei Shan
DOI: https://doi.org/10.1109/icta53157.2021.9661980
2021-01-01
Abstract:Mel frequency cepstrum coefficient (MFCC) is widely used in the feature extraction of speech signals. In practical applications, the feature extraction circuit does not need to work at non-speech conditions. By extracting and reusing the data of framing operation in MFCC, we propose a speech activity detector (SAD) based on the amplitude and zero-crossing rate of the speech to distinguish different states of it. This SAD uses time-domain characteristics of the signal to achieve low power design, its recognition rate is 95.1% at the noiseless environment and 94.6% with a signal-to-noise rate of 20dB. Designed in TSMC 28nm CMOS process, 40kHz and 0.41V supply, the simulation results show that the power is 0.007/0.34 respectively at non-speech and full-speech segments. The long-term average power is only 24 at the typical application with an event rate of 720/h.
What problem does this paper attempt to address?