AI Agents 相关度: 9/10

STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

ELita Lobo, Xu Chen, Jingjing Meng, Nan Xi, Yang Jiao, Chirag Agarwal, Yair Zick, Yan Gao
arXiv: 2603.05294v1 发布: 2026-03-05 更新: 2026-03-05

AI 摘要

STRUCTUREDAGENT通过分层规划和结构化记忆,提升LLM在长程网页任务中的表现。

主要贡献

  • 提出了一种使用动态AND/OR树的在线分层规划框架
  • 设计了一个结构化记忆模块来跟踪和维护候选解决方案
  • 实验表明,该方法在长程网页浏览任务中优于其他LLM Agent

方法论

采用在线分层规划,利用AND/OR树进行高效搜索,并使用结构化记忆模块进行信息跟踪和约束满足。

原文摘要

Recent advances in large language models (LLMs) have enabled agentic systems for sequential decision-making. Such agents must perceive their environment, reason across multiple time steps, and take actions that optimize long-term objectives. However, existing web agents struggle on complex, long-horizon tasks due to limited in-context memory for tracking history, weak planning abilities, and greedy behaviors that lead to premature termination. To address these challenges, we propose STRUCTUREDAGENT, a hierarchical planning framework with two core components: (1) an online hierarchical planner that uses dynamic AND/OR trees for efficient search and (2) a structured memory module that tracks and maintains candidate solutions to improve constraint satisfaction in information-seeking tasks. The framework also produces interpretable hierarchical plans, enabling easier debugging and facilitating human intervention when needed. Our results on WebVoyager, WebArena, and custom shopping benchmarks show that STRUCTUREDAGENT improves performance on long-horizon web-browsing tasks compared to standard LLM-based agents.

标签

LLM Agent 分层规划 结构化记忆 网页浏览

arXiv 分类

cs.AI