LLM in the Shell: Generative Honeypots

Muris Sladić,Veronica Valeros,Carlos Catania,Sebastian Garcia

DOI: https://doi.org/10.1109/EuroSPW61312.2024.00054

2024-09-23

Abstract:Honeypots are essential tools in cybersecurity for early detection, threat intelligence gathering, and analysis of attacker's behavior. However, most of them lack the required realism to engage and fool human attackers long-term. Being easy to distinguish honeypots strongly hinders their effectiveness. This can happen because they are too deterministic, lack adaptability, or lack deepness. This work introduces shelLM, a dynamic and realistic software honeypot based on Large Language Models that generates Linux-like shell output. We designed and implemented shelLM using cloud-based LLMs. We evaluated if shelLM can generate output as expected from a real Linux shell. The evaluation was done by asking cybersecurity researchers to use the honeypot and give feedback if each answer from the honeypot was the expected one from a Linux shell. Results indicate that shelLM can create credible and dynamic answers capable of addressing the limitations of current honeypots. ShelLM reached a TNR of 0.90, convincing humans it was consistent with a real Linux shell. The source code and prompts for replicating the experiments have been publicly available.

Cryptography and Security,Artificial Intelligence,Computation and Language

What problem does this paper attempt to address?

The paper attempts to address the issue that current honeypot systems lack sufficient realism and dynamism when simulating real systems (such as the Linux shell), making them easily identifiable by human attackers. Specifically, existing shell-based honeypot systems have the following problems: 1. **Limited behavior**: The possible operations are limited. 2. **Overly generic file system**: The structure and content of the file system are not complex or realistic enough. 3. **Hardcoded responses**: The provided answers are too fixed and lack flexibility. 4. **Overall system not complex enough**: The entire system does not appear realistic enough. To solve these problems, the researchers proposed a new system called shelLM, which is a dynamic honeypot software based on a large language model (LLM). shelLM aims to generate credible and realistic Linux shell outputs, thereby confusing human attackers for a longer period and delaying their realization that they are not interacting with a real Linux shell. By using LLM technology, shelLM can create all information from the file system structure to on-demand file content, thereby improving the effectiveness and realism of the honeypot system.

LLM in the Shell: Generative Honeypots

LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems

Don't Stop Believin': A Unified Evaluation Approach for LLM Honeypots

LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild

LLMPot: Automated LLM-based Industrial Protocol and Physical Process Emulation for ICS Honeypots

HoneyGPT: Breaking the Trilemma in Terminal Honeypots with Large Language Model

Security Orchestration, Automation, and Response Engine for Deployment of Behavioural Honeypots

HoneyModels: Machine Learning Honeypots

RatGPT: Turning online LLMs into Proxies for Malware Attacks

Honeypot Implementation in a Cloud Environment

Chatbots in a Honeypot World

Act as a Honeytoken Generator! An Investigation into Honeytoken Generation with Large Language Models

HoneyGAN Pots: A Deep Learning Approach for Generating Honeypots

Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks

A Generative Model Based Honeypot for Industrial OPC UA Communication

HoneyFactory: Container-Based Comprehensive Cyber Deception Honeynet Architecture

Large Language Models are Few-shot Generators: Proposing Hybrid Prompt Algorithm To Generate Webshell Escape Samples

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

Constructing Dynamic Honeypot Using Machine Learning

Lessons Learned from the deployment of a high-interaction honeypot

Adaptive Honeypot Engagement through Reinforcement Learning of Semi-Markov Decision Processes