Exploring the Deceptive Power of LLM-Generated Fake News: A Study of Real-World Detection Challenges

Yanshen Sun,Jianfeng He,Limeng Cui,Shuo Lei,Chang-Tien Lu
2024-04-09
Abstract:Recent advancements in Large Language Models (LLMs) have enabled the creation of fake news, particularly in complex fields like healthcare. Studies highlight the gap in the deceptive power of LLM-generated fake news with and without human assistance, yet the potential of prompting techniques has not been fully explored. Thus, this work aims to determine whether prompting strategies can effectively narrow this gap. Current LLM-based fake news attacks require human intervention for information gathering and often miss details and fail to maintain context consistency. Therefore, to better understand threat tactics, we propose a strong fake news attack method called conditional Variational-autoencoder-Like Prompt (VLPrompt). Unlike current methods, VLPrompt eliminates the need for additional data collection while maintaining contextual coherence and preserving the intricacies of the original text. To propel future research on detecting VLPrompt attacks, we created a new dataset named VLPrompt fake news (VLPFN) containing real and fake texts. Our experiments, including various detection methods and novel human study metrics, were conducted to assess their performance on our dataset, yielding numerous findings.
Computation and Language,Social and Information Networks
What problem does this paper attempt to address?
This paper explores the ability of large language models (LLMs) to generate fake news, particularly in complex fields such as healthcare. The research indicates that there is a gap between the deception of fake news generated by LLMs without human assistance and with human involvement, and the current prompting techniques have not been fully explored. Therefore, the paper aims to determine if this gap can be effectively narrowed through prompting strategies. Current LLM fake news attack methods require manual information collection, lack detailed evidence, and cannot maintain contextual consistency. To address this, the paper proposes a strong fake news attack method called Variationally Latent Prompt (VLPrompt), which does not require additional data collection and maintains contextual coherence and preserves the details of the original text. To facilitate research on the detection of VLPrompt attacks, the paper creates a new dataset called VLPFN, which contains both real and fake text. The experiments evaluate various detection methods and novel human evaluation metrics, and the results show that VLPrompt performs better in reducing article generation costs and effectively deceiving both human and automated detectors. Additionally, the paper identifies patterns of LLM-generated fake news to help people detect such articles.