Adversarial Attack and Interpretability of the Deep Neural Net-Work from the Geometric Perspective

Mengfei XIA,Zipeng YE,Wang ZHAO,Ran YI,Yongjin LIU
DOI: https://doi.org/10.1360/ssi-2020-0169
2021-01-01
Abstract:Deep learning has achieved significant success in various engineering fields. However, its drawback has also received considerable attention recently, i.e., it suffers from poor interpretability, weak robustness and difficulty for network training, which seriously affect the security and usability of deep neural networks. Therefore adversarial attacks and interpretability become the focuses of the next generation of artificial intelligence research. In this paper, we survey recent works on them from a novel geometric perspective. We reformulate the problems in traditional deep learning models from the viewpoint of manifold theory, and summarize several strategies for possible optimization of the deep networks based on interpretability. Finally, we state several challenges on the interpretability from manifold theory and outline possible future directions.
What problem does this paper attempt to address?