TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents
AI 摘要
TAPE通过工具引导自适应规划和约束执行,提升LM Agent在复杂环境下的表现。
主要贡献
- 提出TAPE框架,增强LM Agent的规划和执行能力
- 使用图结构聚合多个计划并利用外部求解器寻找可行路径
- 采用约束解码和自适应重规划策略
方法论
TAPE使用图规划、外部求解器、约束解码和自适应重规划等技术,解决LM Agent在复杂环境下的错误和约束问题。
原文摘要
Language Model (LM) agents have demonstrated remarkable capabilities in solving tasks that require multiple interactions with the environment. However, they remain vulnerable in environments where a single error often leads to irrecoverable failure, particularly under strict feasibility constraints. We systematically analyze existing agent frameworks, identifying imperfect planning and stochastic execution as the primary causes. To address these challenges, we propose Tool-guided Adaptive Planning with constrained Execution (TAPE). TAPE enhances planning capability by aggregating multiple plans into a graph and employing an external solver to identify a feasible path. During execution, TAPE employs constrained decoding to reduce sampling noise, while adaptively re-planning whenever environmental feedback deviates from the intended state. Experiments across Sokoban, ALFWorld, MuSiQue, and GSM8K-Hard demonstrate that TAPE consistently outperforms existing frameworks, with particularly large gains on hard settings, improving success rates by 21.0 percentage points on hard settings on average, and by 20.0 percentage points for weaker base models on average. Code and data available at here.