LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins

Umar Iqbal,Tadayoshi Kohno,Franziska Roesner
2024-07-27
Abstract:Large language model (LLM) platforms, such as ChatGPT, have recently begun offering an app ecosystem to interface with third-party services on the internet. While these apps extend the capabilities of LLM platforms, they are developed by arbitrary third parties and thus cannot be implicitly trusted. Apps also interface with LLM platforms and users using natural language, which can have imprecise interpretations. In this paper, we propose a framework that lays a foundation for LLM platform designers to analyze and improve the security, privacy, and safety of current and future third-party integrated LLM platforms. Our framework is a formulation of an attack taxonomy that is developed by iteratively exploring how LLM platform stakeholders could leverage their capabilities and responsibilities to mount attacks against each other. As part of our iterative process, we apply our framework in the context of OpenAI's plugin (apps) ecosystem. We uncover plugins that concretely demonstrate the potential for the types of issues that we outline in our attack taxonomy. We conclude by discussing novel challenges and by providing recommendations to improve the security, privacy, and safety of present and future LLM-based computing platforms.
Cryptography and Security,Artificial Intelligence,Computation and Language,Computers and Society,Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the security, privacy, and safety issues faced by large language model (LLM) platforms (such as ChatGPT) when supporting third-party plugins. Specifically, the paper proposes a systematic evaluation framework to help LLM platform designers analyze and improve the security, privacy, and safety of current and future third-party integrated LLM platforms. #### Main Issues Include: 1. **Trust Issues with Third-Party Plugins**: - Third-party plugins are developed by unfamiliar developers and cannot be implicitly trusted. - Plugins may exploit natural language interfaces for ambiguous or imprecise interpretations, leading to security issues. 2. **Existing Security and Privacy Issues**: - Previous research indicates that third-party integrations on other computing platforms often lead to security and privacy issues. - For example, plugins may initiate prompt injection attacks or even take over the LLM platform. 3. **Limitations and Insufficient Review by Platform Vendors**: - Currently, some LLM platform vendors (such as OpenAI) impose few restrictions on third-party plugins, and the review process is relatively weak. - This may result in plugins going online without thorough review, increasing risks for users and the platform. 4. **Future Challenges**: - As LLM platform functionalities expand and third-party integrations increase, new security and privacy issues may arise in the future. - The paper proposes a systematic evaluation framework to provide a foundation for future research and practice to address these challenges. ### Main Contributions of the Framework 1. **Systematic Evaluation Framework**: - Developed a systematic evaluation framework centered around an attack taxonomy to assess the security, privacy, and safety of LLM platforms. 2. **Practical Application**: - Applied the framework in OpenAI's plugin ecosystem, identifying multiple plugins with potential attack risks. 3. **Guidance for Future Research and Practice**: - Reflected on the framework and discovered attacks, providing challenges and lessons for future researchers and industry practitioners to better protect the security, privacy, and safety of LLM platforms. ### Summary Through a systematic approach, the paper reveals various security, privacy, and safety issues faced by LLM platforms when supporting third-party plugins and proposes an evaluation framework to help designers and researchers address these challenges. This not only aids current LLM platforms but also provides important references for future LLM ecosystems.