Abstract:Advances in automatic speaker verification (ASV) promote research into the formulation of spoofing detection systems for real-world applications. The performance of ASV systems can be degraded severely by multiple types of spoofing attacks, namely, synthetic speech (SS), voice conversion (VC), replay, twins and impersonation, especially in the case of unseen synthetic spoofing attacks. A reliable and robust spoofing detection system can act as a security gate to filter out spoofing attacks instead of having them reach the ASV system. A weighted additive angular margin loss is proposed to address the data imbalance issue, and different margins has been assigned to improve generalization to unseen spoofing attacks in this study. Meanwhile, we incorporate a meta-learning loss function to optimize differences between the embeddings of support versus query set in order to learn a spoofing-category-independent embedding space for utterances. Furthermore, we craft adversarial examples by adding imperceptible perturbations to spoofing speech as a data augmentation strategy, then we use an auxiliary batch normalization (BN) to guarantee that corresponding normalization statistics are performed exclusively on the adversarial examples. Additionally, A simple attention module is integrated into the residual block to refine the feature extraction process. Evaluation results on the Logical Access (LA) track of the ASVspoof 2019 corpus provides confirmation of our proposed approaches' effectiveness in terms of a pooled EER of 0.87%, and a min t-DCF of 0.0277. These advancements offer effective options to reduce the impact of spoofing attacks on voice recognition/authentication systems.

Masking Speech Feature to Detect Adversarial Examples for Speaker Verification

LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification

Understanding and Benchmarking the Commonality of Adversarial Examples

Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference.

Imperceptible Black-Box Waveform-Level Adversarial Attack Towards Automatic Speaker Recognition

Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition

Defending Against Adversarial Attacks in Speaker Verification Systems

Adversarial Sample Detection for Speaker Verification by Neural Vocoders

BypTalker: an Adaptive Adversarial Example Attack to Bypass Prefilter-enabled Speaker Recognition

Voiceprint Mimicry Attack Towards Speaker Verification System in Smart Home

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning

Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition.

Defending Adversarial Attacks on Cloud-aided Automatic Speech Recognition Systems.

Defense Against Adversarial Attacks on Spoofing Countermeasures of ASV

Adversarial Example Detection by Classification for Deep Speech Recognition

Adversarial Privacy Protection on Speech Enhancement

Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples

MultiPAD: A Multivariant Partition-Based Method for Audio Adversarial Examples Detection

PhoneyTalker: an Out-of-the-Box Toolkit for Adversarial Example Attack on Speaker Recognition

VSMask: Defending Against Voice Synthesis Attack via Real-Time Predictive Perturbation

Exploratory Evaluation of Speech Content Masking