Privacy-Preserving Large Language Models (PPLLMs)

Mohammad Raeini
DOI: https://doi.org/10.2139/ssrn.4512071
2023-01-01
SSRN Electronic Journal
Abstract:Recently large language models (LLMs) have gained significant attention as they have shown surprising signs of artificial general intelligence (AGI). Artificial intelligence and large language models can be used for various good purposes, such as digital assistants for knowledge creation. However, such powerful models can have potential risks as well. Among other concerns and risks are security and privacy risks that AI models can pose to data as well as users. In this article, we discuss how mathematical structures, such as polynomial and vector spaces, and privacy-preserving delegation of polynomial and matrix-vector functions can be used for transforming a computational model (including LLMs) to a privacy-preserving computational model.Furthermore, we highlight some well-known cryptographic constructions along with some solutions by which LLMs can be improved, in the sense that they can preserve the privacy and security of data and thus users. Overall, privacy-preserving and zero-knowledge LLMs, that we introduce in this article, could be potential solutions for preserving the privacy of data and users to some good and reasonable extent. More importantly, perhaps AI models should be trained on publicly available trustworthy data; and the trained models should be compressed and used by users locally.
What problem does this paper attempt to address?