AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification

Li Wang,Jiaqi Li,Yuhao Luo,Jiahao Zheng,Lei Wang,Hao Li,Ke Xu,Chengfang Fang,Jie Shi,Zhizheng Wu
2024-01-16
Abstract:It is known that deep neural networks are vulnerable to adversarial attacks. Although Automatic Speaker Verification (ASV) built on top of deep neural networks exhibits robust performance in controlled scenarios, many studies confirm that ASV is vulnerable to adversarial attacks. The lack of a standard dataset is a bottleneck for further research, especially reproducible research. In this study, we developed an open-source adversarial attack dataset for speaker verification research. As an initial step, we focused on the over-the-air attack. An over-the-air adversarial attack involves a perturbation generation algorithm, a loudspeaker, a microphone, and an acoustic environment. The variations in the recording configurations make it very challenging to reproduce previous research. The AdvSV dataset is constructed using the Voxceleb1 Verification test set as its foundation. This dataset employs representative ASV models subjected to adversarial attacks and records adversarial samples to simulate over-the-air attack settings. The scope of the dataset can be easily extended to include more types of adversarial attacks. The dataset will be released to the public under the CC BY-SA 4.0. In addition, we also provide a detection baseline for reproducible research.
Sound,Audio and Speech Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the security issue of Automatic Speaker Verification (ASV) systems in the face of adversarial attacks. Specifically, although ASV systems based on deep neural networks exhibit strong performance in a controlled environment, many studies have shown that these systems are vulnerable to adversarial attacks. In particular, over - the - air attacks involve playing pre - generated adversarial samples through a speaker and then recording them with a microphone, which makes it difficult to reproduce and evaluate the actual effects of the attacks. Therefore, the lack of a standard adversarial attack dataset has become a bottleneck for further research, especially in terms of reproducible research. To meet this challenge, this paper has developed an open - source adversarial attack dataset - AdvSV, which focuses on over - the - air attacks. This dataset is constructed based on the Voxceleb1 verification test set, uses representative ASV models for adversarial attacks, and records adversarial samples to simulate the over - the - air attack environment. By providing such a dataset, researchers can more easily evaluate the security of different ASV systems and promote the research progress of adversarial attacks and their defense measures. In addition, the paper also provides a detection baseline for adversarial attack detection in reproducible research. This not only helps to improve the understanding of the security of existing ASV systems but also provides important tools and resources for future research.