Bionic Cepstral coefficients (BCC): A new auditory feature extraction to noise-robust speaker identification

Youssef Zouhir,Mohamed Zarka,Kaïs Ouni
DOI: https://doi.org/10.1016/j.apacoust.2024.110026
IF: 3.614
2024-04-20
Applied Acoustics
Abstract:Automatic speaker recognition (ASR) faces a major challenge in achieving robust performances, despite humans being able to identify speakers accurately even in noisy environments. By studying the human auditory pathway's anatomy and function, researchers aimed to enhance the performance of conventional methods in noisy environments. The present paper introduces a Bionic Cepstral Coefficients (BCC) approach, which is a feature extraction method for an accurate speaker recognition using Bionic Wavelet FilterBank (BWFB) with ERB-rate scale. The BCC approach applies a cochlea inspired non-linear auditory model to the output of the OMLSA algorithm using IMCRA. Experiments were carried out on clean signals from the TIMIT database augmented with various AURORA noises to assess the proposed BCC approach's performances. Next, we compared BCC's performance to conventional auditory feature extraction approaches, such as MFCC, HFCC, and PNCC. According to the obtained results, the BCC approach performs better than conventional methods, even in noisy environments.
acoustics
What problem does this paper attempt to address?