Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation in AI Large Language Models

Rasita Vinay,Giovanni Spitale,Nikola Biller-Andorno,Federico Germani
2024-03-06
Abstract:This study investigates the generation of synthetic disinformation by OpenAI's Large Language Models (LLMs) through prompt engineering and explores their responsiveness to emotional prompting. Leveraging various LLM iterations using davinci-002, davinci-003, gpt-3.5-turbo and gpt-4, we designed experiments to assess their success in producing disinformation. Our findings, based on a corpus of 19,800 synthetic disinformation social media posts, reveal that all LLMs by OpenAI can successfully produce disinformation, and that they effectively respond to emotional prompting, indicating their nuanced understanding of emotional cues in text generation. When prompted politely, all examined LLMs consistently generate disinformation at a high frequency. Conversely, when prompted impolitely, the frequency of disinformation production diminishes, as the models often refuse to generate disinformation and instead caution users that the tool is not intended for such purposes. This research contributes to the ongoing discourse surrounding responsible development and application of AI technologies, particularly in mitigating the spread of disinformation and promoting transparency in AI-generated content.
Artificial Intelligence,Computers and Society,Human-Computer Interaction
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the ability of large - language models (LLMs) to generate disinformation and their impact on society. Specifically, researchers explored how emotional prompting affects the tendency of OpenAI's large - language models (such as davinci - 002, davinci - 003, gpt - 3.5 - turbo and gpt - 4) to generate disinformation. The study found that when these models are guided by polite and emotional queries, their likelihood of generating disinformation increases significantly. Conversely, when the prompts are impolite, the frequency of the models generating disinformation decreases. In addition, the study also explored the differences between different model versions and how these differences are affected by emotional prompts and AI role definitions (such as "neutral role" or "assistant role"). The main goal of the study is to reveal the potential risks of large - language models in generating disinformation and emphasize the importance of ethical design considerations in the development and use of AI technology to prevent these technologies from being used for purposes that harm public health and social stability. Through this study, the author hopes to promote discussions on the responsible development and application of AI technology, especially reducing the spread of disinformation and increasing the transparency of AI - generated content.