AI Agents 相关度: 9/10

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Razvan Mihai Popescu, David Gros, Andrei Botocan, Rahul Pandita, Prem Devanbu, Maliheh Izadi
arXiv: 2604.00917v1 发布: 2026-04-01 更新: 2026-04-01

AI 摘要

研究AI编码代理在开源项目中的活动模式和代码随时间的变化,发现其贡献与更高的代码变动率相关。

主要贡献

  • 构建了一个包含11万开源PR的数据集
  • 比较了五种流行的编码代理的使用差异
  • 提供了代理生成代码与人类编写代码的生存率和变动率的纵向估计

方法论

构建数据集,分析PR、提交、评论等,比较不同代理的merge频率、文件类型和开发者交互信号,并进行生存分析。

原文摘要

The rise of large language models for code has reshaped software development. Autonomous coding agents, able to create branches, open pull requests, and perform code reviews, now actively contribute to real-world projects. Their growing role offers a unique and timely opportunity to investigate AI-driven contributions and their effects on code quality, team dynamics, and software maintainability. In this work, we construct a novel dataset of approximately $110,000$ open-source pull requests, including associated commits, comments, reviews, issues, and file changes, collectively representing millions of lines of source code. We compare five popular coding agents, including OpenAI Codex, Claude Code, GitHub Copilot, Google Jules, and Devin, examining how their usage differs in various development aspects such as merge frequency, edited file types, and developer interaction signals, including comments and reviews. Furthermore, we emphasize that code authoring and review are only a small part of the larger software engineering process, as the resulting code must also be maintained and updated over time. Hence, we offer several longitudinal estimates of survival and churn rates for agent-generated versus human-authored code. Ultimately, our findings indicate an increasing agent activity in open-source projects, although their contributions are associated with more churn over time compared to human-authored code.

标签

AI Agents Software Engineering Code Generation Open Source Code Churn

arXiv 分类

cs.SE cs.AI cs.LG