DeLLMa: Decision Making Under Uncertainty with Large Language Models

Ollie Liu,Deqing Fu,Dani Yogatama,Willie Neiswanger
2024-10-12
Abstract:The potential of large language models (LLMs) as decision support tools is increasingly being explored in fields such as business, engineering, and medicine, which often face challenging tasks of decision-making under uncertainty. In this paper, we show that directly prompting LLMs on these types of decision-making problems can yield poor results, especially as the problem complexity increases. To aid in these tasks, we propose DeLLMa (Decision-making Large Language Model assistant), a framework designed to enhance decision-making accuracy in uncertain environments. DeLLMa involves a multi-step reasoning procedure that integrates recent best practices in scaling inference-time reasoning, drawing upon principles from decision theory and utility theory, to provide an accurate and human-auditable decision-making process. We validate our procedure on multiple realistic decision-making environments, demonstrating that DeLLMa can consistently enhance the decision-making performance of leading language models, and achieve up to a 40% increase in accuracy over competing methods. Additionally, we show how performance improves when scaling compute at test time, and carry out human evaluations to benchmark components of DeLLMa.
Artificial Intelligence,Computation and Language,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of how to effectively use Large Language Models (LLMs) for decision support under uncertain conditions. Specifically, the authors point out that directly using LLMs to handle complex decision problems often yields poor results, especially as the complexity of the problem increases. Therefore, they propose a framework called DeLLMa (Decision-making Large Language Model assistant), which aims to enhance the decision accuracy of LLMs in uncertain environments through a multi-step reasoning process, and to ensure that the decision-making process is interpretable and auditable by humans. ### Main Issues 1. **Decision Accuracy**: When directly using LLMs for decision support, especially in complex and highly uncertain situations, the decision accuracy is poor. 2. **Interpretability and Auditability**: Existing LLMs lack detailed explanations of the decision-making process, which limits their credibility and reliability in practical applications. ### Solution To address the above issues, the DeLLMa framework is designed with the following key steps: 1. **State Enumeration**: Identify and predict unknown variables related to the decision objective. 2. **State Prediction**: Generate probability distributions for these unknown variables based on contextual information. 3. **Utility Function Extraction**: Construct a utility function based on the user's goals, which maps each state-action pair to a scalar value. 4. **Expected Utility Maximization**: Calculate the expected utility of each action and select the action with the highest expected utility as the final decision. ### Experimental Validation The authors validated the effectiveness of DeLLMa in real decision-making environments in the fields of agricultural planning and financial investment. Experimental results show that DeLLMa can significantly improve decision accuracy, with an increase of up to 40%, and performs consistently across multiple leading LLMs. ### Contributions 1. **Proposing the DeLLMa Framework**: A multi-step reasoning method based on classical decision theory to enhance the decision-making capabilities of LLMs under uncertain conditions. 2. **Evaluation Components**: Including the calibration of state prediction methods and the study of human consistency in utility function extraction methods. 3. **Performance Improvement**: In real decision-making environments, DeLLMa significantly improved decision accuracy compared to other methods and showed consistent performance improvement across multiple LLMs. Through these contributions, DeLLMa not only improves the decision accuracy of LLMs under uncertain conditions but also enhances the transparency and interpretability of the decision-making process, making it more credible in practical applications.