Abstract:Speaker Verification (SV) systems involve mainly two individual stages: feature extraction and classification. In this paper, we explore these two modules with the aim of improving the performance of a speaker verification system under noisy conditions. On the one hand, the choice of the most appropriate acoustic features is a crucial factor for performing robust speaker verification. The acoustic parameters used in the proposed system are: Mel Frequency Cepstral Coefficients (MFCC), their first and second derivatives (Deltas and Delta- Deltas), Bark Frequency Cepstral Coefficients (BFCC), Perceptual Linear Predictive (PLP), and Relative Spectral Transform - Perceptual Linear Predictive (RASTA-PLP). In this paper, a complete comparison of different combinations of the previous features is discussed. On the other hand, the major weakness of a conventional Support Vector Machine (SVM) classifier is the use of generic traditional kernel functions to compute the distances among data points. However, the kernel function of an SVM has great influence on its performance. In this work, we propose the combination of two SVM-based classifiers with different kernel functions: Linear kernel and Gaussian Radial Basis Function (RBF) kernel with a Logistic Regression (LR) classifier. The combination is carried out by means of a parallel structure approach, in which different voting rules to take the final decision are considered. Results show that significant improvement in the performance of the SV system is achieved by using the combined features with the combined classifiers either with clean speech or in the presence of noise. Finally, to enhance the system more in noisy environments, the inclusion of the multiband noise removal technique as a preprocessing stage is proposed.

A PCA Method Based on Speaker Session Variability

Maximum Likelihood I-Vector Space Using PCA for Speaker Verification.

Session Variability Subspace Projection Based Model Compensation for Speaker Verification

Accelerate Training By Dwt In Speaker Identification Using Svms

Experimental evaluation of a new speaker identification framework using PCA.

Channel Compensation Technology In Differential Gsv-Svm Speaker Verification System

Compensation of Intrinsic Variability with Factor Analysis Modeling for Robust Speaker Verification

Improving Speaker Verification Performance Against Long-Term Speaker Variability

Exploiting PCA classifiers to speaker recognition

Intrinsic Variation Robust Speaker Verification Based on Sparse Representation.

Using MMSE to Improve Session Variability Estimation

Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification

A Novel I-Vector Framework Using Multiple Features and PCA for Speaker Recognition in Short Speech Condition

Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition.

A speaker verification backend with robust performance across conditions

Exploring Sequential Characteristics in Speaker Bottleneck Feature for Text-Dependent Speaker Verification.

Improved multitaper PNCC feature for robust speaker verification

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Learning from human perception to improve automatic speaker verification in style-mismatched conditions

Local Pairwise Linear Discriminant Analysis for Speaker Verification

Improving Short Utterance PLDA Speaker Verification using SUV Modelling and Utterance Partitioning Approach