LLM Agents can Autonomously Hack Websites

Richard Fang,Rohan Bindu,Akul Gupta,Qiusi Zhan,Daniel Kang
2024-02-16
Abstract:In recent years, large language models (LLMs) have become increasingly capable and can now interact with tools (i.e., call functions), read documents, and recursively call themselves. As a result, these LLMs can now function autonomously as agents. With the rise in capabilities of these agents, recent work has speculated on how LLM agents would affect cybersecurity. However, not much is known about the offensive capabilities of LLM agents.
Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to explore whether large language model (LLM) agents can autonomously attack websites, especially perform complex tasks such as blind database schema extraction and SQL injection without prior knowledge of vulnerabilities or human feedback. The paper demonstrates the capabilities of these agents through experiments and analyzes the performance differences of different models in this task, especially the capability gap between the state - of - the - art model GPT - 4 and open - source models. In addition, the paper also discusses the network security risks that this technology may bring and the related cost - benefit analysis.