Abstract:The potential of large language models (LLMs) as decision support tools is increasingly being explored in fields such as business, engineering, and medicine, which often face challenging tasks of decision-making under uncertainty. In this paper, we show that directly prompting LLMs on these types of decision-making problems can yield poor results, especially as the problem complexity increases. To aid in these tasks, we propose DeLLMa (Decision-making Large Language Model assistant), a framework designed to enhance decision-making accuracy in uncertain environments. DeLLMa involves a multi-step reasoning procedure that integrates recent best practices in scaling inference-time reasoning, drawing upon principles from decision theory and utility theory, to provide an accurate and human-auditable decision-making process. We validate our procedure on multiple realistic decision-making environments, demonstrating that DeLLMa can consistently enhance the decision-making performance of leading language models, and achieve up to a 40% increase in accuracy over competing methods. Additionally, we show how performance improves when scaling compute at test time, and carry out human evaluations to benchmark components of DeLLMa.

What problem does this paper attempt to address?

The paper attempts to address the problem of how to effectively use Large Language Models (LLMs) for decision support under uncertain conditions. Specifically, the authors point out that directly using LLMs to handle complex decision problems often yields poor results, especially as the complexity of the problem increases. Therefore, they propose a framework called DeLLMa (Decision-making Large Language Model assistant), which aims to enhance the decision accuracy of LLMs in uncertain environments through a multi-step reasoning process, and to ensure that the decision-making process is interpretable and auditable by humans. ### Main Issues 1. **Decision Accuracy**: When directly using LLMs for decision support, especially in complex and highly uncertain situations, the decision accuracy is poor. 2. **Interpretability and Auditability**: Existing LLMs lack detailed explanations of the decision-making process, which limits their credibility and reliability in practical applications. ### Solution To address the above issues, the DeLLMa framework is designed with the following key steps: 1. **State Enumeration**: Identify and predict unknown variables related to the decision objective. 2. **State Prediction**: Generate probability distributions for these unknown variables based on contextual information. 3. **Utility Function Extraction**: Construct a utility function based on the user's goals, which maps each state-action pair to a scalar value. 4. **Expected Utility Maximization**: Calculate the expected utility of each action and select the action with the highest expected utility as the final decision. ### Experimental Validation The authors validated the effectiveness of DeLLMa in real decision-making environments in the fields of agricultural planning and financial investment. Experimental results show that DeLLMa can significantly improve decision accuracy, with an increase of up to 40%, and performs consistently across multiple leading LLMs. ### Contributions 1. **Proposing the DeLLMa Framework**: A multi-step reasoning method based on classical decision theory to enhance the decision-making capabilities of LLMs under uncertain conditions. 2. **Evaluation Components**: Including the calibration of state prediction methods and the study of human consistency in utility function extraction methods. 3. **Performance Improvement**: In real decision-making environments, DeLLMa significantly improved decision accuracy compared to other methods and showed consistent performance improvement across multiple LLMs. Through these contributions, DeLLMa not only improves the decision accuracy of LLMs under uncertain conditions but also enhances the transparency and interpretability of the decision-making process, making it more credible in practical applications.

DeLLMa: Decision Making Under Uncertainty with Large Language Models

Efficient Sequential Decision Making with Large Language Models

On the Modeling Capabilities of Large Language Models for Sequential Decision Making

Large Language Model As Autonomous Decision Maker

Argumentative Large Language Models for Explainable and Contestable Decision-Making

Introspective Tips: Large Language Model for In-Context Decision Making

Building Decision Making Models Through Language Model Regime

Optimal Decision Making Through Scenario Simulations Using Large Language Models

LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models

Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context

Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning

On the Decision-Making Abilities in Role-Playing using Large Language Models

LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement

Efficient Non-Parametric Uncertainty Quantification for Black-Box Large Language Models and Decision Planning

Efficient Reinforcement Learning with Large Language Model Priors

Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

STEER: Assessing the Economic Rationality of Large Language Models

Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain

Alignment Between the Decision-Making Logic of LLMs and Human Cognition: A Case Study on Legal LLMs

"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming

Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications