Abstract:Software vulnerabilities, caused by unintentional flaws in source code, are a primary root cause of cyberattacks. Static analysis of source code has been widely used to detect these unintentional defects introduced by software developers. Large Language Models (LLMs) have demonstrated human-like conversational abilities due to their capacity to capture complex patterns in sequential data, such as natural languages. In this paper, we harness LLMs' capabilities to analyze source code and detect known vulnerabilities. To ensure the proposed vulnerability detection method is universal across multiple programming languages, we convert source code to LLVM IR and train LLMs on these intermediate representations. We conduct extensive experiments on various LLM architectures and compare their accuracy. Our comprehensive experiments on real-world and synthetic codes from NVD and SARD demonstrate high accuracy in identifying source code vulnerabilities.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is vulnerability detection in software source code. Specifically, software vulnerabilities caused by unintentional defects in the source code are the main source of cyber - attacks, and these vulnerabilities may lead to serious social and economic losses. Although traditional static analysis methods are widely used to detect these defects, they have some limitations, such as being unable to accurately identify specific vulnerable lines and lacking universality when dealing with different programming languages. To overcome these problems, this research proposes a method based on large - language models (LLMs) to analyze source code and detect known vulnerabilities. By converting the source code into LLVM intermediate representation (IR) and then training LLMs to analyze these intermediate representations, this method aims to provide a universal vulnerability - detection solution across multiple programming languages. Experimental results show that this method has high vulnerability - detection accuracy on real - world and synthetic - code datasets. The key steps include: 1. **Source - code conversion**: Uniformly convert the source code of different programming languages into LLVM IR to ensure the universality of the method. 2. **Feature extraction**: Extract syntactic and semantic features from LLVM IR to generate intermediate representations (iSeVCs). 3. **Model training**: Use a custom tokenizer to convert the intermediate representations into unique identifiers and train LLMs for vulnerability detection. 4. **Performance evaluation**: Verify the effectiveness of the proposed method by conducting comparative experiments with existing methods (such as VulDeeLocator and LSTM - based methods). The goal of this research is to utilize the powerful capabilities of LLMs to improve the accuracy and universality of source - code vulnerability detection.

Harnessing the Power of LLMs in Source Code Vulnerability Detection

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

Understanding the Effectiveness of Large Language Models in Detecting Security Vulnerabilities

Outside the Comfort Zone: Analysing LLM Capabilities in Software Vulnerability Detection

Software Vulnerability and Functionality Assessment using LLMs

LLbezpeky: Leveraging Large Language Models for Vulnerability Detection

Towards Effectively Detecting and Explaining Vulnerabilities Using Large Language Models

VulnLLMEval: A Framework for Evaluating Large Language Models in Software Vulnerability Detection and Patching

Attention Is All You Need for LLM-based Code Vulnerability Localization

Multitask-based Evaluation of Open-Source LLM on Software Vulnerability

Large Language Models for Secure Code Assessment: A Multi-Language Empirical Study

How Far Have We Gone in Vulnerability Detection Using Large Language Models

LLMs Cannot Reliably Identify and Reason About Security Vulnerabilities (Yet?): A Comprehensive Evaluation, Framework, and Benchmarks

Enhanced Automated Code Vulnerability Repair using Large Language Models

LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning

Smart Contract Vulnerability Detection: The Role of Large Language Model (LLM)

VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language Models

Can LLMs be Fooled? Investigating Vulnerabilities in LLMs

An Empirical Study of Automated Vulnerability Localization with Large Language Models

RealVul: Can We Detect Vulnerabilities in Web Applications with LLM?