Abstract:The integration of tools in augmenting large language models presents a novel approach toward enhancing the efficiency and accuracy of these models in handling specific, complex tasks. This paper delves into the methodology,challenges, and developments in the realm of teaching LLMs to use external tools, thereby pushing the boundaries of their capabilities beyond pre-existing knowledge bases. We introduce a standardized paradigm for tool integration guided by a series of functions that map user instructions to actionable plans and their execution, emphasizing the significance of understanding user intent, tool selection, and dynamic plan adjustment. Our exploration reveals the various challenges encountered, such as tool invocation timing, selection accuracy, and the need for robust reasoning processes. In addressing these challenges, we investigate techniques within the context of fine-tuning and incontext learning paradigms, highlighting innovative approaches to ensure diversity, augment datasets, and improve <a class="link-external link-http" href="http://generalization.Furthermore" rel="external noopener nofollow">this http URL</a>, we investigate a perspective on enabling LLMs to not only utilize but also autonomously create tools, which may redefine their role from mere tool users to tool creators. Finally,we reproduced Chameleon's results on ScienceQA and analyzed the code structure.

What problem does this paper attempt to address?

The paper aims to address the challenges faced by large language models (LLMs) in specific professional domains, especially in scenarios requiring high precision and real-time performance. The core objective of the paper is to improve the accuracy and efficiency of LLMs in handling complex tasks by teaching them how to use external tools. Specifically, the paper explores the following points: 1. **Timing of Tool Invocation**: Determining when to invoke a tool is crucial. External tools should be called only when the model itself cannot directly provide an accurate answer or solution, such as in scenarios requiring access to real-time data, performing complex calculations in specific domains, or handling documents in special formats. 2. **Tool Selection and Accuracy**: Choosing the right tool is essential for the successful completion of tasks. However, as the number of available tools increases, ensuring the accuracy of each invocation becomes more challenging. 3. **Method of Tool Invocation**: Effectively using tools requires not only knowing their existence but also understanding how to invoke them correctly. This includes understanding tool interfaces, parameter types, and values. 4. **Robustness of the Reasoning Process**: Throughout the reasoning process, the accumulation of errors can amplify problems and affect the quality of results. Therefore, establishing a mechanism for detecting and correcting errors is critical. 5. **Time Efficiency**: Time efficiency is a key consideration when using LLMs. As the reasoning and tool invocation processes become more complex, the time cost of the entire process also increases accordingly. 6. **Generalization Ability**: Achieving Artificial General Intelligence (AGI) is an ultimate goal in the field of artificial intelligence. However, current technologies still face many challenges, particularly in how to invoke different tools based on different scenarios to solve complex reasoning problems. The paper proposes a standardized approach to teach LLMs how to use external tools and explores three main strategies: fine-tuning, in-context learning, and enabling the model to autonomously create tools. These methods aim to enhance the functionality of LLMs and their ability to perform specific tasks.

LLM With Tools: A Survey

Tool Learning with Large Language Models: A Survey

Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

What Are Tools Anyway? A Survey from the Language Model Perspective

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

ToolQA: A Dataset for LLM Question Answering with External Tools

Tool-Augmented LLMs as a Universal Interface for IDEs

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Towards Tool Use Alignment of Large Language Models

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Towards Practical Tool Usage for Continually Learning LLMs

ControlLLM: Augment Language Models with Tools by Searching on Graphs

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

Chain of Tools: Large Language Model is an Automatic Multi-tool Learner

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios