The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
AI 摘要
该论文综述了LLM Agent工具使用从单工具调用到多工具编排的演变,并分析了当前的研究进展。
主要贡献
- 统一了任务形式,区分了单次调用和长程编排。
- 围绕六个核心维度组织文献,全面分析了多工具LLM Agent。
- 总结了在软件工程、企业工作流等领域的代表性应用。
方法论
论文通过系统性的文献回顾,分析了多工具LLM Agent的研究现状,并提出了未来研究方向。
原文摘要
Tool use enables large language models (LLMs) to access external information, invoke software systems, and act in digital environments beyond what can be solved from model parameters alone. Early research mainly studied whether a model could select and execute a correct single tool call. As agent systems evolve, however, the central problem has shifted from isolated invocation to multi-tool orchestration over long trajectories with intermediate state, execution feedback, changing environments, and practical constraints such as safety, cost, and verifiability. We comprehensively review recent progress in multi-tool LLM agents and analyzes the state of the art in this rapidly developing area. First, we unify task formulations and distinguish single-call tool use from long-horizon orchestration. Then, we organize the literature around six core dimensions: inference-time planning and execution, training and trajectory construction, safety and control, efficiency under resource constraints, capability completeness in open environments, and benchmark design and evaluation. We further summarize representative applications in software engineering, enterprise workflows, graphical user interfaces, and mobile systems. Finally, we discuss major challenges and outline future directions for building reliable, scalable, and verifiable multi-tool agents.