MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology

Ran Zhou,Shuai Zhao,Mingming Luo,Xin Meng,Jie Ma,Jianfei Liu
DOI: https://doi.org/10.1007/s11801-024-3167-5
2024-03-02
Optoelectronics Letters
Abstract:The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel frequency cepstral coefficient (MFCC) method. A classic ancient Chinese poem "You Zi Yin", also called "A Traveler's Song", was analyzed both in time and frequency domains, where its real-time reproduction was achieved with a 116.91 ms time delay. The smaller scaled MFCC 0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the 6 original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated to be 0.580 6 higher than 0.188 3 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.
optics
What problem does this paper attempt to address?