Abstract:Behavior study experiments are an important part of society modeling and understanding human interactions. In practice, many behavioral experiments encounter challenges related to internal and external validity, reproducibility, and social bias due to the complexity of social interactions and cooperation in human user studies. Recent advances in Large Language Models (LLMs) have provided researchers with a new promising tool for the simulation of human behavior. However, existing LLM-based simulations operate under the unproven hypothesis that LLM agents behave similarly to humans as well as ignore a crucial factor in human decision-making: emotions. In this paper, we introduce a novel methodology and the framework to study both, the decision-making of LLMs and their alignment with human behavior under emotional states. Experiments with GPT-3.5 and GPT-4 on four games from two different classes of behavioral game theory showed that emotions profoundly impact the performance of LLMs, leading to the development of more optimal strategies. While there is a strong alignment between the behavioral responses of GPT-3.5 and human participants, particularly evident in bargaining games, GPT-4 exhibits consistent behavior, ignoring induced emotions for rationality decisions. Surprisingly, emotional prompting, particularly with `anger' emotion, can disrupt the "superhuman" alignment of GPT-4, resembling human emotional responses.

What problem does this paper attempt to address?

The main problems that this paper attempts to solve are: 1. **Decision - making optimization of large - language models (LLM) under emotional influence**: Research how emotional injection affects the decision - making process of LLM - based agents in cooperation and bargaining games, and explore whether these emotions can prompt LLM to generate more optimal strategies. 2. **Consistency between LLM behavior and human behavior**: Evaluate whether the behavior of LLM can simulate human behavior in different emotional states, especially in cooperation and bargaining games. Specifically, the paper focuses on whether emotions will make the behavior of LLM closer to that of humans. 3. **The influence of emotions on the tendency to cooperate**: Explore how emotional motivation alleviates the increased tendency to cooperate and provides the ability to adapt to complex behaviors, especially in repeated games. The research also attempts to understand whether emotional injection can make LLM - based agents exhibit better behavior than emotional humans. ### Specific research questions - **RQ1**: How do emotional prompts affect the optimality of decisions made by LLM - based agents in strategic and cooperative situations? - **RQ2**: When inducing human emotional states in LLM, is there consistency between LLM behavior and human reactions? Can emotions make AI more like humans? - **RQ3**: How do emotional motivations reduce the increased tendency to cooperate and adapt to complex behaviors in repeated games? Can emotional LLM agents produce behavior superior to that of emotional humans, and can emotional prompts promote this progress? ### Overview of methodology To study these questions, the author developed a novel framework to inject emotions into the decision - making process of LLM. This framework includes the following aspects: 1. **Selected game types**: - **Bargaining games**: Such as the dictator game and the ultimatum game. - **Two - person, two - action repeated games**: Such as the prisoner's dilemma and the battle of the sexes. 2. **Emotional integration**: - Inject five basic emotions: anger, sadness, happiness, disgust, and fear. - Use three emotional prompt strategies: "simple", "based on co - player", and "based on external". 3. **Experimental settings**: - Use two state - of - the - art LLMs, GPT - 3.5 and GPT - 4, for experiments. - Improve reasoning ability through the Chain - of - Thought (CoT) method. - Analyze the dynamic changes of emotions and track the emotional states at the end of each round of the game. Through these methods, the paper aims to reveal the role of emotions in LLM decision - making and provide valuable insights for future research.

The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games

Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games

Playing Games With GPT: What Can We Learn About a Large Language Model From Canonical Strategic Games?

Can Large Language Models Serve as Rational Players in Game Theory? A Systematic Analysis

GPT-3.5 altruistic advice is sensitive to reciprocal concerns but not to strategic risk

GPT-4 Emulates Average-Human Emotional Cognition from a Third-Person Perspective

Strategic Behavior of Large Language Models: Game Structure vs. Contextual Framing

What's Next in Affective Modeling? Large Language Models

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

Is GPT a Computational Model of Emotion? Detailed Analysis

Rethinking Emotion Annotations in the Era of Large Language Models

Using cognitive psychology to understand GPT-3

The Machine Psychology of Cooperation: Can GPT models operationalise prompts for altruism, cooperation, competitiveness and selfishness in economic games?

Assessing Large Language Models' ability to predict how humans balance self-interest and the interest of others

The Emergence of Economic Rationality of GPT

Can large language models help predict results from a complex behavioural science study?

Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay

Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?

Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis

Real-time emotion generation in human-robot dialogue using large language models

Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?