AI Agents 相关度: 9/10

PaperVoyager : Building Interactive Web with Visual Language Models

Dasen Dai, Biao Wu, Meng Fang, Wenhao Wang
arXiv: 2603.22999v1 发布: 2026-03-24 更新: 2026-03-24

AI 摘要

PaperVoyager将科研论文转化为可交互的Web系统,提升了科学论文的理解和交互方式。

主要贡献

  • 提出了Paper-to-Interactive-System Agent
  • 构建了结构化的生成框架PaperVoyager
  • 提出了包含19篇论文的评估基准

方法论

通过对论文进行理解、系统建模和交互式网页合成,实现无需人工干预的端到端处理。

原文摘要

Recent advances in visual language models have enabled autonomous agents for complex reasoning, tool use, and document understanding. However, existing document agents mainly transform papers into static artifacts such as summaries, webpages, or slides, which are insufficient for technical papers involving dynamic mechanisms and state transitions. In this work, we propose a Paper-to-Interactive-System Agent that converts research papers into executable interactive web systems. Given a PDF paper, the agent performs end-to-end processing without human intervention, including paper understanding, system modeling, and interactive webpage synthesis, enabling users to manipulate inputs and observe dynamic behaviors. To evaluate this task, we introduce a benchmark of 19 research papers paired with expert-built interactive systems as ground truth. We further propose PaperVoyager, a structured generation framework that explicitly models mechanisms and interaction logic during synthesis. Experiments show that PaperVoyager significantly improves the quality of generated interactive systems, offering a new paradigm for interactive scientific paper understanding.

标签

VLM Agent Interactive System

arXiv 分类

cs.CL