Adversarial attacks on neural networks through canonical Riemannian foliations

Eliot Tron,Nicolas Couëllan,Stéphane Puechmorel
DOI: https://doi.org/10.1007/s10994-024-06624-w
IF: 5.414
2024-10-28
Machine Learning
Abstract:Deep learning models are known to be vulnerable to adversarial attacks. Adversarial learning is therefore becoming a crucial task. We propose a new vision on neural network robustness using Riemannian geometry and foliation theory. The idea is illustrated by creating a new adversarial attack that takes into account the curvature of the data space. This new adversarial attack, called the two-step spectral attack , is a piece-wise linear approximation of a geodesic in the data space. The data space is treated as a (degenerate) Riemannian manifold equipped with the pullback of the Fisher Information Metric (FIM) of the neural network. In most cases, this metric is only semi-definite and its kernel becomes a central object to study. A canonical foliation is derived from this kernel. The curvature of transverse leaves gives the appropriate correction to get a two-step approximation of the geodesic and hence a new efficient adversarial attack. The method is first illustrated on a 2D toy example in order to visualize the neural network foliation and the corresponding attacks. Next, we report numerical results on the MNIST and CIFAR10 datasets with the proposed technique and state of the art attacks presented by Zhao et al. (in: Proceedings of the AAAI conference on artificial intelligence, vol 33. pp 5869–5876, 2019) (OSSA) and Croce and Hein (in: III HD, Singh A (eds) Proceedings of machine learning research, vol 119, PMLR, Cambridge, pp 2206–2216, https://proceedings.mlr.press/v119/croce20b.html, 2020) (AutoAttack). The results show that the proposed attack is more efficient at all levels of available budget for the attack (norm of the attack), confirming that the curvature of the transverse neural network FIM foliation plays an important role in the robustness of neural networks. The main objective and interest of this study is to provide a mathematical understanding of the geometrical issues at play in the data space when constructing efficient attacks on neural networks.
computer science, artificial intelligence
What problem does this paper attempt to address?