Multimodal Learning 相关度: 8/10

FailureMem: A Failure-Aware Multimodal Framework for Autonomous Software Repair

Ruize Ma, Yilei Jiang, Shilin Zhang, Zheng Ma, Yi Feng, Vincent Ng, Zhi Wang, Xiangyu Yue, Chuanyi Li, Lewei Lu
arXiv: 2603.17826v1 发布: 2026-03-18 更新: 2026-03-18

AI 摘要

FailureMem是一个多模态自动软件修复框架,通过记忆失败经验提升修复成功率。

主要贡献

  • 提出混合工作流-Agent架构,平衡结构化定位与灵活推理
  • 引入主动感知工具,实现区域级视觉定位
  • 建立失败记忆库,将历史修复尝试转化为可复用指导

方法论

FailureMem结合混合架构、主动视觉感知和失败记忆库,提升LLM在多模态软件修复中的性能。

原文摘要

Multimodal Automated Program Repair (MAPR) extends traditional program repair by requiring models to jointly reason over source code, textual issue descriptions, and visual artifacts such as GUI screenshots. While recent LLM-based repair systems have shown promising results, existing approaches face several limitations: rigid workflow pipelines restrict exploration during debugging, visual reasoning is often performed over full-page screenshots without localized grounding, and failed repair attempts are rarely transformed into reusable knowledge. To address these challenges, we propose FailureMem, a multimodal repair framework that integrates three key mechanisms: a hybrid workflow-agent architecture that balances structured localization with flexible reasoning, active perception tools that enable region-level visual grounding, and a Failure Memory Bank that converts past repair attempts into reusable guidance. Experiments on SWE-bench Multimodal demonstrate FailureMem improves the resolved rate over GUIRepair by 3.7%.

标签

自动软件修复 多模态学习 LLM Agent Failure Memory

arXiv 分类

cs.SE cs.AI