A One-class Model for Voice Replay Attack Detection

Xingliang Cheng,Lantian Li,Mingxing Xu,Dong Wang,Thomas Fang Zheng
DOI: https://doi.org/10.1007/978-981-19-5288-3_14
2023-01-01
Abstract:Replay attack poses a serious security concern for automatic speaker verification systems. Most of the existing replay detection methods cast the task to a binary classification problem. In this article, by analyzing distributions of genuine and replayed speech with a specifically designed database and summarizing the known artifacts in existing datasets, we show the potential shortcomings of the two-class approach in both discrimination and generalization, and discuss the advantage of the one-class approach. As a demonstration, we present our recent investigation on a novel one-class-based replay detection method, which models the discrepancy between the test speech and the enrollment speech by a Gaussian mixture model, thus casting the replay detection task to an out-of-distribution detection task.
What problem does this paper attempt to address?