Meta-Reasoning Improves Tool Use in Large Language Models

Lisa Alazraki,Marek Rei

2024-11-07

Abstract:External tools help large language models (LLMs) succeed at tasks where they would otherwise typically fail. In existing frameworks, LLMs learn tool use either by in-context demonstrations or via full model fine-tuning on annotated data. As these approaches do not easily scale, a recent trend is to abandon them in favor of lightweight, parameter-efficient tuning paradigms. These methods allow quickly alternating between the frozen LLM and its specialised fine-tuned version, by switching on or off a handful of additional custom parameters. Hence, we postulate that the generalization ability of the frozen model can be leveraged to improve tool selection. We present Tool selECTion via meta-reasONing (TECTON), a two-phase system that first reasons over a task using a custom fine-tuned LM head and outputs candidate tools. Then, with the custom head disabled, it meta-reasons (i.e., it reasons over the previous reasoning process) to make a final choice. We show that TECTON results in substantial gains - both in-distribution and out-of-distribution - on a range of math reasoning datasets.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to effectively utilize external tools in large - scale language models (LLMs) to improve their performance in knowledge - intensive tasks, especially in mathematical reasoning tasks. Existing methods either learn tool use through context examples or perform full - model fine - tuning on labeled data. However, these methods all have scalability problems and it is difficult to handle a large number of tools or adapt to new tool sets. To solve these problems, the authors propose a new two - stage framework - Tool selECTion via meta - reas ONing (TECTON), aiming to improve tool selection through meta - reasoning, thereby improving the performance of LLMs in mathematical reasoning tasks. Specifically, the main contributions of the paper are as follows: 1. **Introduced TECTON**: A two - stage framework that combines a custom - fine - tuned head and a frozen LLM to improve tool use in mathematical reasoning tasks. 2. **Demonstrated the superior performance of TECTON**: TECTON outperforms strong baseline models both on in - distribution data and unseen benchmark tests. 3. **Enhanced three popular mathematical reasoning data sets**: Made them more challenging and shared the data and code. Through these contributions, the paper aims to show how to utilize the generalization ability of LLMs to improve tool selection through meta - reasoning, so as to achieve better performance in complex tasks.

Meta-Reasoning Improves Tool Use in Large Language Models

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models

Multi-tool Integration Application for Math Reasoning Using Large Language Model

MetaTool: Facilitating Large Language Models to Master Tools with Meta-task Augmentation

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

Meta Reasoning for Large Language Models

Large Language Models as Tool Makers

Rational Metareasoning for Large Language Models

On the Impact of Fine-Tuning on Chain-of-Thought Reasoning

MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

ART: Automatic multi-step reasoning and tool-use for large language models

MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models

Efficient Tool Use with Chain-of-Abstraction Reasoning

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Making Large Language Models Better Reasoners with Alignment

Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

Distilling Reasoning Ability from Large Language Models with Adaptive Thinking