Attentional Multi-Feature Fusion for Spoofing-Aware Speaker Verification

Qian Shen,Mengxi Guo,YiDa Huang,Jianfen Ma
DOI: https://doi.org/10.1007/s10772-024-10112-w
2024-01-01
International Journal of Speech Technology
Abstract:The Spoofing-Aware Speaker Verification (SASV) system is designed to protect automatic speaker verification (ASV) systems from potential speech spoofing attacks by integrating the ASV and countermeasure systems. The optimization of the ASV system can further enhance the resistance of the SASV system to various spoofing methods. Thus, an Attentional Multi-Feature Fusion framework is proposed in this paper to enhance the speech feature content in the ASV system, aiming to mitigate security vulnerabilities. Furthermore, for feature modeling, we introduce the Conformer module, which combines convolutional neural networks and Transformers to effectively capture both local and global features while extracting fixed-dimensional speaker embedding vectors. The experimental results demonstrate that the proposed ASV system achieves a nearly 25
What problem does this paper attempt to address?