Multimedia application for forensic automatic speaker recognition from disguised voices using MFCC feature extraction and classification techniques
Mahesh K. Singh
DOI: https://doi.org/10.1007/s11042-024-18602-4
IF: 2.577
2024-02-24
Multimedia Tools and Applications
Abstract:In forensic automatic speaker recognition (FASR), voice disguise is a most important disquiet in the field of multimedia applications. It is most important to classify the disguised voice in order to recognise the speaker’s identity. Changing the pitch is one of the most popular types of voice disguises used by criminals. From the normal pitch to the change in raised pitch, lower pitch, fast pitch, and slow pitch, the most disguised type amongst the different types of disguised voice. The acoustic features in terms of mean and correlation coefficients for raised pitch and lower pitch have different coefficient values, similarly for fast and slow pitch. Using an innovative set of features, we have recognised the classification efficiency for raised pitch, lower pitch, fast speech, and slow speech disguised voice for female and male voice samples used in this proposed research work for multimedia applications. Mel-frequency cepstrum coefficient (MFCC), delta MFCC (ΔMFCC), and double delta MFCC (ΔΔMFCC) features are extracted from normal voice and entirely four varieties of disguised voice. The acoustic feature, correlation coefficients and mean are extracted from all voice samples using the three different MFCC feature extraction techniques in multimedia applications. Feature-based classifiers for example k-nearest neighbour (k-NN) and support vector machines (SVM) are used for classification and recognise the speaker’s classification efficiency. The classification results show that applying these feature-based SVM classifiers increases the classification efficiency to 98.58%, compared to 94.23% when using feature-based k-NN classifiers. It is observed that for the standard normal voice, there is 100% classification efficiency for both SVM and k-NN classifiers.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering