Research on Voiceprint Recognition Based on MFCC-PCA-LSTM

Yong Liang,Xiang Gong,Linlin Xiong,Zhenyu Liao
DOI: https://doi.org/10.1109/bdicn58493.2023.00032
2023-01-01
Abstract:Voiceprint recognition is mainly to mine the personality factors of the speaker s physiological characteristics to identify his identity. It has the characteristics of non-contact, easy data acquisition and stable signal characteristics.This paper mainly uses deep learning network to realize text-independent voiceprint recognition. Firstly, the sound perception feature Mel frequency cepstrum coefficient (MFCC) and its first-order differential feature component ($\Delta$MFCC) are extracted and combined with feature vectors. Through principal component analysis (PCA) and data normalization method, the extracted voiceprint feature matrix is compressed and input into long short-term memory network (LSTM) and bidirectional LSTM network model for voiceprint target classification. Through the experimental verification of the four algorithm models, the results show that the (MFCC+MFCC)-PCA-LSTM model has the highest accuracy of voiceprint recognition in the THCHS-30 data set, and its accuracy can reach 93%.
Computer Science
What problem does this paper attempt to address?