AI Agents 相关度: 9/10

An Empirical Study of Multi-Agent Collaboration for Automated Research

Yang Shen, Zhenyi Yi, Ziyi Zhao, Lijun Sun, Dongyang Li, Chin-Teng Lin, Yuhui Shi
arXiv: 2603.29632v1 发布: 2026-03-31 更新: 2026-03-31

AI 摘要

论文对比研究了不同多智能体协作架构在自动化机器学习优化中的性能,揭示了稳定性和理论深度之间的权衡。

主要贡献

  • 提出了一个严格控制的、基于执行的测试平台,用于评估多智能体系统。
  • 对比了子代理架构和代理团队架构在自动化机器学习优化中的性能。
  • 揭示了操作稳定性和理论审议之间的权衡,并提出了针对未来自动研究系统的设计建议。

方法论

通过严格控制的实验,对比单智能体、子代理和代理团队三种架构在固定计算资源下的自动化机器学习优化表现。

原文摘要

As AI agents evolve, the community is rapidly shifting from single Large Language Models (LLMs) to Multi-Agent Systems (MAS) to overcome cognitive bottlenecks in automated research. However, the optimal multi-agent coordination framework for these autonomous agents remains largely unexplored. In this paper, we present a systematic empirical study investigating the comparative efficacy of distinct multi-agent structures for automated machine learning optimization. Utilizing a rigorously controlled, execution-based testbed equipped with Git worktree isolation and explicit global memory, we benchmark a single-agent baseline against two multi-agent paradigms: a subagent architecture (parallel exploration with post-hoc consolidation) and an agent team architecture (experts with pre-execution handoffs). By evaluating these systems under strictly fixed computational time budgets, our findings reveal a fundamental trade-off between operational stability and theoretical deliberation. The subagent mode functions as a highly resilient, high-throughput search engine optimal for broad, shallow optimizations under strict time constraints. Conversely, the agent team topology exhibits higher operational fragility due to multi-author code generation but achieves the deep theoretical alignment necessary for complex architectural refactoring given extended compute budgets. These empirical insights provide actionable guidelines for designing future autoresearch systems, advocating for dynamically routed architectures that adapt their collaborative structures to real-time task complexity.

标签

多智能体系统 自动化机器学习 架构搜索 AI Agent

arXiv 分类

cs.MA cs.AI