On Visible Adversarial Perturbations & Digital Watermarking

Jamie Hayes
DOI: https://doi.org/10.1109/cvprw.2018.00210
2018-06-01
Abstract:Given a machine learning model, adversarial perturbations transform images such that the model's output is classified as an attacker chosen class. Most research in this area has focused on adversarial perturbations that are imperceptible to the human eye. However, recent work has considered attacks that are perceptible but localized to a small region of the image. Under this threat model, we discuss both defenses that remove such adversarial perturbations, and attacks that can bypass these defenses.
What problem does this paper attempt to address?