Confidential Computing within an AI Accelerator

Kapil Vaswani; Stavros Volos; Cédric Fournet; Antonio Nino Diaz; Ken Gordon; Balaji Vembu; Sam Webster; David Chisnall; Saurabh Kulkarni; Graham Cunningham; Richard Osborne; Daniel Wilkinson

Confidential Computing within an AI Accelerator

Kapil Vaswani ,
Stavros Volos ,
Cédric Fournet ,
Antonio Nino Diaz ,
Ken Gordon ,
Balaji Vembu ,
Sam Webster ,
David Chisnall ,
Saurabh Kulkarni ,
Graham Cunningham ,
Richard Osborne ,
Daniel Wilkinson

2023 USENIX Annual Technical Conference | July 2023

Download BibTex

We present IPU Trusted Extensions (ITX), a set of hardware extensions that enables trusted execution environments in Graphcore’s AI accelerators. ITX enables the execution of AI workloads with strong confidentiality and integrity guarantees at low performance overheads. ITX isolates workloads from untrusted hosts, and ensures their data and models remain encrypted at all times except within the accelerator’s chip. ITX includes a hardware root-of-trust that provides attestation capabilities and orchestrates trusted execution, and on-chip programmable cryptographic engines for authenticated encryption of code/data at PCIe bandwidth. We also present software for ITX in the form of compiler and runtime extensions that support multi-party training without requiring a CPU-based TEE.

We included experimental support for ITX in Graphcore’s GC200 IPU taped out at TSMC’s 7nm node. Its evaluation on a development board using standard DNN training workloads suggests that ITX adds <5% performance overhead and delivers up to 17x better performance compared to CPU-based confidential computing systems based on AMD SEV-SNP.