Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients

Carlos Alberto Hernández-Nava,Eric Alfredo Rincón-García,Pedro Lara-Velázquez,Sergio Gerardo de-Los-Cobos-Silva,Miguel Angel Gutiérrez-Andrade,Roman Anselmo Mora-Gutiérrez
DOI: https://doi.org/10.7717/peerj-cs.1740
2023-12-18
Abstract:Nowadays, biometric authentication has gained relevance due to the technological advances that have allowed its inclusion in many daily-use devices. However, this same advantage has also brought dangers, as spoofing attacks are now more common. This work addresses the vulnerabilities of automatic speaker verification authentication systems, which are prone to attacks arising from new techniques for the generation of spoofed audio. In this article, we present a countermeasure for these attacks using an approach that includes easy to implement feature extractors such as spectrograms and mel frequency cepstral coefficients, as well as a modular architecture based on deep neural networks. Finally, we evaluate our proposal using the well-know ASVspoof 2017 V2 database, the experiments show that using the final architecture the best performance is obtained, achieving an equal error rate of 6.66% on the evaluation set.
What problem does this paper attempt to address?