AI Agents 相关度: 9/10

Anagent For Enhancing Scientific Table & Figure Analysis

Xuehang Guo, Zhiyong Lu, Tom Hope, Qingyun Wang
arXiv: 2602.10081v1 发布: 2026-02-10 更新: 2026-02-10

AI 摘要

Anagent通过多智能体框架提升科学表格和图表分析能力,显著提高了解释准确性。

主要贡献

  • 提出了AnaBench,一个大规模科学表格和图表分析的基准数据集。
  • 构建了Anagent,一个多智能体框架,包含Planner、Expert、Solver和Critic四个模块。
  • 开发了模块化训练策略,结合监督微调和强化学习优化智能体协作。

方法论

构建多智能体框架,分解任务,检索信息,综合分析,迭代评估,并通过监督微调和强化学习优化。

原文摘要

In scientific research, analysis requires accurately interpreting complex multimodal knowledge, integrating evidence from different sources, and drawing inferences grounded in domain-specific knowledge. However, current artificial intelligence (AI) systems struggle to consistently demonstrate such capabilities. The complexity and variability of scientific tables and figures, combined with heterogeneous structures and long-context requirements, pose fundamental obstacles to scientific table \& figure analysis. To quantify these challenges, we introduce AnaBench, a large-scale benchmark featuring $63,178$ instances from nine scientific domains, systematically categorized along seven complexity dimensions. To tackle these challenges, we propose Anagent, a multi-agent framework for enhanced scientific table \& figure analysis through four specialized agents: Planner decomposes tasks into actionable subtasks, Expert retrieves task-specific information through targeted tool execution, Solver synthesizes information to generate coherent analysis, and Critic performs iterative refinement through five-dimensional quality assessment. We further develop modular training strategies that leverage supervised finetuning and specialized reinforcement learning to optimize individual capabilities while maintaining effective collaboration. Comprehensive evaluation across 170 subdomains demonstrates that Anagent achieves substantial improvements, up to $\uparrow 13.43\%$ in training-free settings and $\uparrow 42.12\%$ with finetuning, while revealing that task-oriented reasoning and context-aware problem-solving are essential for high-quality scientific table \& figure analysis. Our project page: https://xhguo7.github.io/Anagent/.

标签

多模态学习 智能体 科学分析

arXiv 分类

cs.CL cs.AI