Abstract:In current scenario, speaker recognition under noisy condition is the major challenging task in the area of speech processing. Due to noise environment there is a significant degradation in the system performance. The major aim of the proposed work is to identify the speaker's under clean and noise background using limited dataset. In this paper, we proposed a multitaper based Mel frequency cepstral coefficients (MFCC) and power normalization cepstral coefficients (PNCC) techniques with fusion strategies. Here, we used MFCC and PNCC techniques with different multitapers to extract the desired features from the obtained speech samples. Then, cepstral mean and variance normalization (CMVN) and Feature warping (FW) are the two techniques applied to normalize the obtained features from both the techniques. Furthermore, as a system model low dimension i-vector model is used and also different fusion score strategies like mean, maximum, weighted sum, cumulative and concatenated fusion techniques are utilized. Finally extreme learning machine (ELM) is used for classification in order to increase the system identification accuracy (SIA) intern which is having a single layer feedforward neural network with less complexity and time consuming compared to other neural networks. TIMIT and SITW 2016 are the two different databases are used to evaluate the proposed system under limited data of these databases. Both clean and noisy backgrounds conditions are used to check the SIA.

Speaker Identification Using Time-delay Hmes.

Speaker Identification Based on the Time-Delay Hierarchical Mixture of Experts

Combine Multiple Time-Delay HMEs for Speaker Identification

Learning Virtual HD Model for Bi-model Emotional Speaker Recognition

Emotional Speaker Identification By Humans And Machines

Real-time Speaker Recognition System for PDA

An Hmm/Mfnn Hybrid Architecture Based On Stacked Generalization For Speaker Identification

Text-dependent Speaker Identification Based on Input/output HMMs: an Empirical Study

Efficient Speaker Recognition Based on Multi-class Twin Support Vector Machines and GMMs

Methods of Combining Multiple Classifiers with Different Features and Their Applications to Text-Independent Speaker Identification

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

Hybrid Architecture Based on Fuzzy Classifier and Multiplayer Feed-Forward Neural Network for Speaker Identification

Eigenvoice Factor Analysis in Short Time Speaker Recognition

A fused hidden Markov model with application to bimodal speech processing

Short Time Speaker Recognition Method Based on Common Feature Selection

ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score

Robust Time-Delay Estimation for Speaker Localization Using Mutual Information Among Multiple Microphone Signals

The Cohort-Selection And Normalized Hidden Markov Model For Speaker Recognition

Speaker Identification based on LSP and Gaussian Mixture Model

Wavelet-Based Mel-Frequency Cepstral Coefficients for Speaker Identification using Hidden Markov Models