Poster Abstract: CNN-guardian: Secure Neural Network Inference Acceleration on Edge GPU.

Qipeng Xie,Hao Yang,Linshan Jiang,Zhihe Zhao,Siyang Jiang,Shiyu Shen,Salabat Khan,Zhe Liu,Kaishun Wu
DOI: https://doi.org/10.1145/3625687.3628394
2023-01-01
Abstract:The rapid development of AI applications powered by deep learning in edge devices boosts the opportunity for real-time health monitoring. To address the potential privacy concern in the inference phase, homomorphic encryption (HE) is an alternative solution that encrypts inference data without exposing raw data and has several distinct advantages, (i.e., single-round communication, lightweight bandwidth consumption, and non-interactive computation). However, the computational overhead on the current HE-based privacy-preserving inference necessitates a substantial amount of time, which is not feasible for some real-time applications on edge devices. To address this issue, we propose CNN-guardian, a unified and compact neural network structure for real-time inference in HE-based inference on edge GPU. CNN-guardian designs a HE-friendly neural network and GPU engine that optimizes HE operations to accelerate the inference in the HE domain.
What problem does this paper attempt to address?