Hot PATE: Private Aggregation of Distributions for Diverse Task

Edith Cohen,Benjamin Cohen-Wang,Xin Lyu,Jelani Nelson,Tamas Sarlos,Uri Stemmer
2024-05-18
Abstract:The Private Aggregation of Teacher Ensembles (PATE) framework is a versatile approach to privacy-preserving machine learning. In PATE, teacher models that are not privacy-preserving are trained on distinct portions of sensitive data. Privacy-preserving knowledge transfer to a student model is then facilitated by privately aggregating teachers' predictions on new examples. Employing PATE with generative auto-regressive models presents both challenges and opportunities. These models excel in open ended \emph{diverse} (aka hot) tasks with multiple valid responses. Moreover, the knowledge of models is often encapsulated in the response distribution itself and preserving this diversity is critical for fluid and effective knowledge transfer from teachers to student. In all prior designs, higher diversity resulted in lower teacher agreement and thus -- a tradeoff between diversity and privacy. Prior works with PATE thus focused on non-diverse settings or limiting diversity to improve utility.
Machine Learning,Artificial Intelligence,Cryptography and Security,Data Structures and Algorithms
What problem does this paper attempt to address?