The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games

Mikhail Mozikov,Nikita Severin,Valeria Bodishtianu,Maria Glushanina,Mikhail Baklashkin,Andrey V. Savchenko,Ilya Makarov
2024-06-05
Abstract:Behavior study experiments are an important part of society modeling and understanding human interactions. In practice, many behavioral experiments encounter challenges related to internal and external validity, reproducibility, and social bias due to the complexity of social interactions and cooperation in human user studies. Recent advances in Large Language Models (LLMs) have provided researchers with a new promising tool for the simulation of human behavior. However, existing LLM-based simulations operate under the unproven hypothesis that LLM agents behave similarly to humans as well as ignore a crucial factor in human decision-making: emotions. In this paper, we introduce a novel methodology and the framework to study both, the decision-making of LLMs and their alignment with human behavior under emotional states. Experiments with GPT-3.5 and GPT-4 on four games from two different classes of behavioral game theory showed that emotions profoundly impact the performance of LLMs, leading to the development of more optimal strategies. While there is a strong alignment between the behavioral responses of GPT-3.5 and human participants, particularly evident in bargaining games, GPT-4 exhibits consistent behavior, ignoring induced emotions for rationality decisions. Surprisingly, emotional prompting, particularly with `anger' emotion, can disrupt the "superhuman" alignment of GPT-4, resembling human emotional responses.
Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are: 1. **Decision - making optimization of large - language models (LLM) under emotional influence**: Research how emotional injection affects the decision - making process of LLM - based agents in cooperation and bargaining games, and explore whether these emotions can prompt LLM to generate more optimal strategies. 2. **Consistency between LLM behavior and human behavior**: Evaluate whether the behavior of LLM can simulate human behavior in different emotional states, especially in cooperation and bargaining games. Specifically, the paper focuses on whether emotions will make the behavior of LLM closer to that of humans. 3. **The influence of emotions on the tendency to cooperate**: Explore how emotional motivation alleviates the increased tendency to cooperate and provides the ability to adapt to complex behaviors, especially in repeated games. The research also attempts to understand whether emotional injection can make LLM - based agents exhibit better behavior than emotional humans. ### Specific research questions - **RQ1**: How do emotional prompts affect the optimality of decisions made by LLM - based agents in strategic and cooperative situations? - **RQ2**: When inducing human emotional states in LLM, is there consistency between LLM behavior and human reactions? Can emotions make AI more like humans? - **RQ3**: How do emotional motivations reduce the increased tendency to cooperate and adapt to complex behaviors in repeated games? Can emotional LLM agents produce behavior superior to that of emotional humans, and can emotional prompts promote this progress? ### Overview of methodology To study these questions, the author developed a novel framework to inject emotions into the decision - making process of LLM. This framework includes the following aspects: 1. **Selected game types**: - **Bargaining games**: Such as the dictator game and the ultimatum game. - **Two - person, two - action repeated games**: Such as the prisoner's dilemma and the battle of the sexes. 2. **Emotional integration**: - Inject five basic emotions: anger, sadness, happiness, disgust, and fear. - Use three emotional prompt strategies: "simple", "based on co - player", and "based on external". 3. **Experimental settings**: - Use two state - of - the - art LLMs, GPT - 3.5 and GPT - 4, for experiments. - Improve reasoning ability through the Chain - of - Thought (CoT) method. - Analyze the dynamic changes of emotions and track the emotional states at the end of each round of the game. Through these methods, the paper aims to reveal the role of emotions in LLM decision - making and provide valuable insights for future research.