Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Zhiyuan Hu,Chumin Liu,Xidong Feng,Yilun Zhao,See-Kiong Ng,Anh Tuan Luu,Junxian He,Pang Wei Koh,Bryan Hooi
2024-05-30
Abstract:In the face of uncertainty, the ability to *seek information* is of fundamental importance. In many practical applications, such as medical diagnosis and troubleshooting, the information needed to solve the task is not initially given and has to be actively sought by asking follow-up questions (for example, a doctor asking a patient for more details about their symptoms). In this work, we introduce Uncertainty of Thoughts (UoT), an algorithm to augment large language models with the ability to actively seek information by asking effective questions. UoT combines 1) an *uncertainty-aware simulation approach* which enables the model to simulate possible future scenarios and how likely they are to occur, 2) *uncertainty-based rewards* motivated by information gain which incentivizes the model to seek information, and 3) a *reward propagation scheme* to select the optimal question to ask in a way that maximizes the expected reward. In experiments on medical diagnosis, troubleshooting, and the `20 Questions` game, UoT achieves an average performance improvement of 38.1% in the rate of successful task completion across multiple LLMs compared with direct prompting and also improves efficiency (i.e., the number of questions needed to complete the task). Our code has been released [here](<a class="link-external link-https" href="https://github.com/zhiyuanhubj/UoT" rel="external noopener nofollow">this https URL</a>)
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
This paper attempts to address the problem of how large language models (LLMs) can effectively seek information when faced with uncertainty. Specifically, the paper focuses on scenarios where the information needed to solve a problem is not provided upfront but must be actively obtained through questioning, as is common in many practical applications such as medical diagnosis and troubleshooting. Therefore, the paper proposes an algorithm called "Uncertainty of Thoughts" (UoT) to enhance the ability of large language models to actively seek information through effective questioning. ### Main Issues: 1. **How can large language models actively seek information when faced with uncertainty?** - In many practical application scenarios, such as medical diagnosis and troubleshooting, the information needed to solve a problem is not fully provided at the outset but must be gradually obtained through questioning. 2. **How to design an algorithm that enables large language models to generate effective questions, thereby maximizing the efficiency and accuracy of information acquisition?** - The paper proposes an algorithm called "Uncertainty of Thoughts" (UoT), which combines uncertainty-aware simulation methods, an information gain-based reward mechanism, and a reward propagation scheme to select the optimal questions. ### Specific Goals: - **Increase the success rate of task completion**: With the UoT algorithm, the average success rate of large language models across multiple tasks increased by 38.1%. - **Improve efficiency**: Reduce the number of questions needed to complete tasks, thereby increasing the efficiency of information acquisition. ### Application Scenarios: - **Medical Diagnosis**: Doctors ask questions to obtain detailed symptoms from patients to make accurate diagnoses. - **Troubleshooting**: Technicians ask questions to gradually narrow down the scope of a fault and ultimately find the root cause of the problem. - **20 Questions Game**: Guessing an unknown object or entity through a series of "yes/no" questions. ### Method Overview: 1. **Question Generation and Simulation**: UoT first generates multiple candidate questions and simulates the potential future scenarios each question might trigger. 2. **Uncertainty-Based Reward Calculation**: Evaluate the effectiveness of each question through information gain, calculating the extent to which it reduces the model's uncertainty. 3. **Reward Propagation Scheme**: Select the question with the highest expected reward by recursively calculating cumulative and expected rewards. ### Contributions: 1. **Introduction of the UoT Algorithm**: A plug-in method that enables large language models to explicitly model and reduce uncertainty. 2. **Proposal of a New Benchmark Dataset**: Includes 3 tasks and 5 datasets to evaluate the ability of large language models to actively seek information. 3. **Experimental Validation**: Experiments conducted on multiple large language models show significant improvements in task success rates and efficiency with UoT. Through these methods and contributions, the paper provides a new solution for the effective operation of large language models in environments with high uncertainty and ambiguity.