Abstract:Deep learning unlocks applications with societal impacts, e.g., detecting child exploitation imagery and genomic analysis of rare diseases. Deployment, however, needs compliance with stringent privacy regulations. Training algorithms that preserve the privacy of training data are in pressing need. Purely cryptographic approaches can protect privacy, but they are still costly, even when they rely on two or more non-colluding servers. Seemingly-"trivial" operations in plaintext quickly become prohibitively inefficient when a series of them are "crypto-processed," e.g., (dynamic) quantization for ensuring the intermediate values would not overflow. Slalom, recently proposed by Tramer and Boneh, is the first solution that leverages both GPU (for efficient batch computation) and a trusted execution environment (TEE) (for minimizing the use of cryptography). Roughly, it works by a lot of pre-computation over known and fixed weights, and hence it only supports private inference. Five related problems for private training are left unaddressed. Goten, our privacy-preserving training and prediction framework, tackles all five problems simultaneously via our careful design over the "mismatched" cryptographic and GPU data types (due to the tension between precision and efficiency) and our round-optimal GPU-outsourcing protocol (hence minimizing the communication cost between servers). It 1) stochastically trains a low-bitwidth yet accurate model, 2) supports dynamic quantization (a challenge left by Slalom), 3) minimizes the memory-swapping overhead of the memory-limited TEE and its communication with GPU, 4) crypto-protects the (dynamic) model weight from untrusted GPU, and 5) outperforms a pure-TEE system, even without pre-computation (needed by Slalom). As a baseline, we build CaffeScone that secures Caffe using TEE but not GPU; Goten shows a 6.84x speed-up of the whole VGG-11. Goten also outperforms Falcon proposed by Wagh et al., the latest secure multi-server cryptographic solution, by 132.64x using VGG-11. Lastly, we demonstrate Goten's efficacy in training models for breast cancer diagnosis over sensitive images.

Occlumency

GELU-Net: A Globally Encrypted, Locally Unencrypted Deep Neural Network for Privacy-Preserved Learning.

CHEETAH: An Ultra-Fast, Approximation-Free, and Privacy-Preserved Neural Network Framework based on Joint Obscure Linear and Nonlinear Computations

A Distributed Privacy-Preserving Framework for Deep Learning with Edge-Cloud Computing.

Model Protection: Real-Time Privacy-Preserving Inference Service for Model Privacy at the Edge

Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training

Protecting In-memory Data Cache with Secure Enclaves in Untrusted Cloud.

Lightweight and Unobtrusive Privacy Preservation for Remote Inference Via Edge Data Obfuscation

SecoInfer: Secure DNN End-Edge Collaborative Inference Framework Optimizing Privacy and Latency

Efficient and Secure Deep Learning Inference in Trusted Processor Enabled Edge Clouds

No Privacy Left Outside: on the (In-)Security of TEE-Shielded DNN Partition for On-Device ML

ShadowNet: A Secure and Efficient On-device Model Inference System for Convolutional Neural Networks

Learning to Prevent Input Leakages in the Mobile Cloud Inference

Lightweight and Unobtrusive Data Obfuscation at IoT Edge for Remote Inference

All Rivers Run to the Sea: Private Learning with Asymmetric Flows

Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices

Privacy preserving layer partitioning for Deep Neural Network models

Goten: GPU-Outsourcing Trusted Execution of Neural Network Training

Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware

Secure and Verifiable Inference in Deep Neural Networks.

Edge-Enabled Distributed Deep Learning for 5G Privacy Protection