Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior

Hu Zhang,Linchao Zhu,Yi Zhu,Yi Yang
DOI: https://doi.org/10.48550/arXiv.2003.07637
2020-03-17
Computer Vision and Pattern Recognition
Abstract:Deep neural networks are known to be susceptible to adversarial noise, which are tiny and imperceptible perturbations. Most of previous work on adversarial attack mainly focus on image models, while the vulnerability of video models is less explored. In this paper, we aim to attack video models by utilizing intrinsic movement pattern and regional relative motion among video frames. We propose an effective motion-excited sampler to obtain motion-aware noise prior, which we term as sparked prior. Our sparked prior underlines frame correlations and utilizes video dynamics via relative motion. By using the sparked prior in gradient estimation, we can successfully attack a variety of video classification models with fewer number of queries. Extensive experimental results on four benchmark datasets validate the efficacy of our proposed method.
What problem does this paper attempt to address?