AI Agents 相关度: 9/10

FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation

Zimu Lu, Houxing Ren, Yunqiao Yang, Ke Wang, Zhuofan Zong, Mingjie Zhan, Hongsheng Li
arXiv: 2602.03798v1 发布: 2026-02-03 更新: 2026-02-03

AI 摘要

FullStack-Agent通过多智能体框架、回译学习和综合测试,提升全栈Web应用开发的性能。

主要贡献

  • 提出FullStack-Agent系统,包含开发、学习和测试三个模块
  • 设计FullStack-Dev多智能体框架,具备规划、编辑、导航和调试能力
  • 提出FullStack-Learn自提升方法,通过回译提升LLM性能

方法论

构建多智能体系统,结合回译技术进行数据增强,并利用全面基准测试评估模型在全栈各方面的表现。

原文摘要

Assisting non-expert users to develop complex interactive websites has become a popular task for LLM-powered code agents. However, existing code agents tend to only generate frontend web pages, masking the lack of real full-stack data processing and storage with fancy visual effects. Notably, constructing production-level full-stack web applications is far more challenging than only generating frontend web pages, demanding careful control of data flow, comprehensive understanding of constantly updating packages and dependencies, and accurate localization of obscure bugs in the codebase. To address these difficulties, we introduce FullStack-Agent, a unified agent system for full-stack agentic coding that consists of three parts: (1) FullStack-Dev, a multi-agent framework with strong planning, code editing, codebase navigation, and bug localization abilities. (2) FullStack-Learn, an innovative data-scaling and self-improving method that back-translates crawled and synthesized website repositories to improve the backbone LLM of FullStack-Dev. (3) FullStack-Bench, a comprehensive benchmark that systematically tests the frontend, backend and database functionalities of the generated website. Our FullStack-Dev outperforms the previous state-of-the-art method by 8.7%, 38.2%, and 15.9% on the frontend, backend, and database test cases respectively. Additionally, FullStack-Learn raises the performance of a 30B model by 9.7%, 9.5%, and 2.8% on the three sets of test cases through self-improvement, demonstrating the effectiveness of our approach. The code is released at https://github.com/mnluzimu/FullStack-Agent.

标签

AI Agent Full-Stack Development Code Generation Back-Translation Multi-Agent System

arXiv 分类

cs.SE cs.CL cs.CV