Tool Learning with Large Language Models: A Survey

Changle Qu,Sunhao Dai,Xiaochi Wei,Hengyi Cai,Shuaiqiang Wang,Dawei Yin,Jun Xu,Ji-Rong Wen
2024-05-30
Abstract:Recently, tool learning with large language models (LLMs) has emerged as a promising paradigm for augmenting the capabilities of LLMs to tackle highly complex problems. Despite growing attention and rapid advancements in this field, the existing literature remains fragmented and lacks systematic organization, posing barriers to entry for newcomers. This gap motivates us to conduct a comprehensive survey of existing works on tool learning with LLMs. In this survey, we focus on reviewing existing literature from the two primary aspects (1) why tool learning is beneficial and (2) how tool learning is implemented, enabling a comprehensive understanding of tool learning with LLMs. We first explore the "why" by reviewing both the benefits of tool integration and the inherent benefits of the tool learning paradigm from six specific aspects. In terms of "how", we systematically review the literature according to a taxonomy of four key stages in the tool learning workflow: task planning, tool selection, tool calling, and response generation. Additionally, we provide a detailed summary of existing benchmarks and evaluation methods, categorizing them according to their relevance to different stages. Finally, we discuss current challenges and outline potential future directions, aiming to inspire both researchers and industrial developers to further explore this emerging and promising area. We also maintain a GitHub repository to continually keep track of the relevant papers and resources in this rising area at \url{<a class="link-external link-https" href="https://github.com/quchangle1/LLM-Tool-Survey" rel="external noopener nofollow">this https URL</a>}.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper primarily explores the integration of large language models (LLMs) with tool learning, aiming to enhance the ability of LLMs to handle complex problems. Specifically, the paper attempts to address the following key issues: 1. **Significance of Tool Learning**: The paper first elaborates on the importance of tool learning for LLMs, including: - **Knowledge Acquisition**: By integrating tools such as search engines, LLMs can access the latest information and data, thereby compensating for their inherent knowledge limitations. - **Professional Capability Enhancement**: Utilizing external resources like mathematical tools and programming environments to improve LLMs' expertise in specific domains. - **Automation and Efficiency**: Enhancing the practicality of LLMs through functionalities like task scheduling and reminder settings. - **Interaction Experience Optimization**: Using multimodal tools and translation tools to enhance LLMs' understanding and interaction experience. - **Explainability and User Trust**: Increasing the transparency of LLMs by showcasing the decision-making process, thereby boosting user trust. - **Robustness and Adaptability**: Reducing reliance on statistical patterns in training data, improving LLMs' robustness to changes in external input and their ability to adapt to new environments. 2. **Methods of Tool Learning**: The paper further details the specific implementation methods of tool learning, including four key stages: - **Task Planning**: Analyzing user queries, understanding their intent, and breaking them down into solvable sub-problems. - **Tool Selection**: Choosing the most suitable tools from the available options to solve the problem. - **Tool Invocation**: Executing the operations of the selected tools, such as querying databases or running code. - **Response Generation**: Generating the final response to the user based on the results returned by the tools. 3. **Review of Existing Research**: The paper also systematically reviews existing related work, including different tool learning methods, evaluation benchmarks, and the challenges faced. In summary, the goal of this paper is to provide a comprehensive overview of how tool learning can help LLMs overcome existing limitations, particularly in enhancing their ability to handle complex problems, and to offer a systematic reference framework for researchers and developers in this field.