Confidential Machine Learning within Graphcore IPUs

Kapil Vaswani,Stavros Volos,Cédric Fournet,Antonio Nino Diaz,Ken Gordon,Balaji Vembu,Sam Webster,David Chisnall,Saurabh Kulkarni,Graham Cunningham,Richard Osborne,Dan Wilkinson
DOI: https://doi.org/10.48550/arXiv.2205.09005
2022-05-20
Abstract:We present IPU Trusted Extensions (ITX), a set of experimental hardware extensions that enable trusted execution environments in Graphcore's AI accelerators. ITX enables the execution of AI workloads with strong confidentiality and integrity guarantees at low performance overheads. ITX isolates workloads from untrusted hosts, and ensures their data and models remain encrypted at all times except within the IPU. ITX includes a hardware root-of-trust that provides attestation capabilities and orchestrates trusted execution, and on-chip programmable cryptographic engines for authenticated encryption of code and data at PCIe bandwidth. We also present software for ITX in the form of compiler and runtime extensions that support multi-party training without requiring a CPU-based TEE. Experimental support for ITX is included in Graphcore's GC200 IPU taped out at TSMC's 7nm technology node. Its evaluation on a development board using standard DNN training workloads suggests that ITX adds less than 5% performance overhead, and delivers up to 17x better performance compared to CPU-based confidential computing systems relying on AMD SEV-SNP.
Cryptography and Security,Artificial Intelligence,Hardware Architecture
What problem does this paper attempt to address?