Security and Privacy Challenges of Large Language Models: A Survey

Badhan Chandra Das,M. Hadi Amini,Yanzhao Wu
2024-01-30
Abstract:Large Language Models (LLMs) have demonstrated extraordinary capabilities and contributed to multiple fields, such as generating and summarizing text, language translation, and question-answering. Nowadays, LLM is becoming a very popular tool in computerized language processing tasks, with the capability to analyze complicated linguistic patterns and provide relevant and appropriate responses depending on the context. While offering significant advantages, these models are also vulnerable to security and privacy attacks, such as jailbreaking attacks, data poisoning attacks, and Personally Identifiable Information (PII) leakage attacks. This survey provides a thorough review of the security and privacy challenges of LLMs for both training data and users, along with the application-based risks in various domains, such as transportation, education, and healthcare. We assess the extent of LLM vulnerabilities, investigate emerging security and privacy attacks for LLMs, and review the potential defense mechanisms. Additionally, the survey outlines existing research gaps in this domain and highlights future research directions.
Computation and Language,Artificial Intelligence,Cryptography and Security
What problem does this paper attempt to address?
The main focus of this paper is on the challenges faced by Large Language Models (LLMs) in terms of security and privacy. With the widespread use of LLMs in areas such as text generation, summarization, translation, and question answering, they have also become vulnerable to security threats such as jailbreaking attacks, data poisoning attacks, and Personally Identifiable Information (PII) leakage. The paper comprehensively reviews the security and privacy issues of LLMs in terms of training data and users, as well as the application risks in different domains (such as transportation, education, and healthcare). It evaluates the vulnerabilities of LLMs and explores emerging security and privacy attacks, as well as potential defense mechanisms. Additionally, the paper highlights the gaps in current research and future research directions. The aim of the paper is to provide researchers, practitioners, and other stakeholders with a clear understanding of the challenges in LLM security and privacy, in order to design new evaluation protocols and attack defense strategies that meet the evolving needs of LLMs.