Abstract:Recently biometric authentication has made progress in areas, such as speaker verification. However, some evidence shows that the technology is susceptible to malicious spoofing attacks, and thus dedicated countermeasures are needed to detect a variety of specific attack types. Inspired by the great success of deep learning in automatic speech recognition, we propose a detailed deep learning based feature engineering framework for spoofing detection in this paper. To incorporate deep learning into spoofing detection, this work proposes novel approaches for extracting and using features from deep learning models. In contrast to the traditional short-term spectral features, such as MFCC or PLP, outputs from the hidden layer of various deep models are employed as deep features for spoofing detection. Two frameworks are developed to extract deep features, including DNN-based frame-level feature extraction and RNN-based sequence-level feature extraction, and several structures are explored within each framework. Once the deep features are extracted, they can be used as a spoofing identity representation for each utterance, and the appropriate back-end classifier is then applied to make the final detection decision. These approaches were evaluated on the ASVspoof2015 Challenge data corpus. Experiments show that deep feature based systems achieve good performance, even without using any designed features such as phase and cochlea features common in spoofing detection, and obtain significant performance improvements compared to the traditional baselines. The EER of the best deep feature system achieves nearly 0.0% for all attack types from S1 to S9, and gets 1.1% on all averaged conditions (plus S10), which is very promising performance in ASVspoof2015 Challenge task.

Anti-Spoofing Speaker Verification System with Multi-Feature Integration and Multi-Task Learning

Multi-task Learning Based Spoofing-Robust Automatic Speaker Verification System

End-to-end Spoofing Speech Detection and Knowledge Distillation under Noisy Conditions

Siamese Network with Wav2vec Feature for Spoofing Speech Detection

Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection

Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion.

Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches

Spoofing-Aware Speaker Verification by Multi-Level Fusion

Anti-spoofing Methods for Automatic SpeakerVerification System

Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference.

Robust Deep Feature For Spoofing Detection - The Sjtu System For Asvspoof 2015 Challenge

The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge

Attention-Based Convolutional Neural Network for ASV Spoofing Detection.

Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis

Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning

A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification

Deep Features for Automatic Spoofing Detection

Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples

Simultaneous Utilization of Spectral Magnitude and Phase Information to Extract Supervectors for Speaker Verification Anti-Spoofing

Speaker-Aware Anti-Spoofing