Adversarial Attacks, Regression, and Numerical Stability Regularization

Andre T. Nguyen,Edward Raff
DOI: https://doi.org/10.48550/arXiv.1812.02885
IF: 5.414
2018-12-07
Machine Learning
Abstract:Adversarial attacks against neural networks in a regression setting are a critical yet understudied problem. In this work, we advance the state of the art by investigating adversarial attacks against regression networks and by formulating a more effective defense against these attacks. In particular, we take the perspective that adversarial attacks are likely caused by numerical instability in learned functions. We introduce a stability inducing, regularization based defense against adversarial attacks in the regression setting. Our new and easy to implement defense is shown to outperform prior approaches and to improve the numerical stability of learned functions.
What problem does this paper attempt to address?