Large Language Models as Tool Makers
Tianle Cai,Xuezhi Wang,Tengyu Ma,Xinyun Chen,Denny Zhou
2023-05-27
Abstract:Recent research has highlighted the potential of large language models (LLMs)
to improve their problem-solving capabilities with the aid of suitable external
tools. In our work, we further advance this concept by introducing a
closed-loop framework, referred to as LLMs A s Tool Makers (LATM), where LLMs
create their own reusable tools for problem-solving. Our approach consists of
two phases: 1) tool making: an LLM acts as the tool maker that crafts tools for
a set of tasks. 2) tool using: another LLM acts as the tool user, which applies
the tool built by the tool maker for problem-solving. On the problem-solving
server side, tool-making enables continual tool generation and caching as new
requests emerge. This framework enables subsequent requests to access cached
tools via their corresponding APIs, enhancing the efficiency of task
resolution. Recognizing that tool-making requires more sophisticated
capabilities, we assign this task to a powerful, albeit resource-intensive,
model. Conversely, the simpler tool-using phase is delegated to a lightweight
model. This strategic division of labor allows the once-off cost of tool-making
to be spread over multiple instances of tool-using, significantly reducing
average costs while maintaining strong performance. Furthermore, our method
offers a functional cache through the caching and reuse of tools, which stores
the functionality of a class of requests instead of the natural language
responses from LLMs, thus extending the applicability of the conventional cache
mechanism. We evaluate our approach across various complex reasoning tasks,
including Big-Bench tasks. With GPT-4 as the tool maker and GPT-3.5 as the tool
user, LATM demonstrates performance equivalent to using GPT-4 for both roles,
but with a significantly reduced inference cost.
Machine Learning,Computation and Language,Artificial Intelligence