Improved Lightcnn with Attention Modules for Asv Spoofing Detection

Xinyue Ma,Tianyu Liang,Shanshan Zhang,Shen Huang,Liang He
DOI: https://doi.org/10.1109/ICME51207.2021.9428313
2021-01-01
Abstract:With the advent of many state-of-the-art speech synthesis or conversion techniques, speaker recognition system is confronted with the imperceptible interference caused by spoofed speech. In this paper, we realize that filter bank distributions of cepstral features in the frequency domain cause considerable influence to the system performance through experiments. Therefore, given attention mechanism may help to focus on key information, this paper presents improved light convolutional neural network (LCNN) with attention modules, separately named Squeeze-and-Excitation (SE) block, Convolutional Block Attention Module (CBAM) and Dual Attention Network (DANet). To our knowledge, we are the first to do systematic study of attention mechanisms in the field of speech anti-spoofing. Experimented on ASVspoof 2019 dataset, our proposed single system can get 22% min-tDCF reduction as well as 42% EER reduction over LCNN baseline and its performance can even rank fourth among all participating teams using fusion systems in ASVspoof 2019 competition.
What problem does this paper attempt to address?