Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble Classifiers

Nagaratna B. Chittaragi,Ambareesh Prakash,Shashidhar G. Koolagudi
DOI: https://doi.org/10.1007/s13369-017-2941-0
IF: 2.807
2017-11-17
Arabian Journal for Science and Engineering
Abstract:In this paper, investigation of the significance of spectral and prosodic behaviors of speech signal has been carried out for dialect identification. Spectral features such as cepstral coefficients, spectral flux, and entropy are extracted from shorter frames. Prosodic attributes such as pitch, energy, and duration are derived from longer frames. IViE (Intonational Variations in English) speech corpus covering nine dialectal regions of British Isles has been considered, to evaluate the proposed approach. Since corpus is available in both read and semi-spontaneous modes, the influence of spectral and prosodic behavior over these datasets is distinguishably articulated. Further, two distinct classification algorithms, namely support vector machine (SVM) and an ensemble of decision trees along with the SVM are used for identification of nine dialects. Dialect discriminating information captured from both features are used for constructing feature vectors. Experiments have been conducted on individual and combinations of features. A better dialect recognition performance is observed with ensemble methods over a single independent SVM.
multidisciplinary sciences
What problem does this paper attempt to address?