AI Agents 相关度: 9/10

MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN

Ling Luo, Qianqian Bai
arXiv: 2603.03024v1 发布: 2026-03-03 更新: 2026-03-03

AI 摘要

MA-CoNav是一个多智能体协作框架,用于解决复杂视觉语言导航中的感知和决策问题。

主要贡献

  • 提出了 Master-Slave 多智能体协作架构
  • 引入了 Local-Global 双重反射机制
  • 在真实机器人数据集上验证了框架的有效性

方法论

采用分层协作架构,将导航任务分解到不同智能体,利用双重反射机制优化导航流程。

原文摘要

Vision-Language Navigation (VLN) aims to empower robots with the ability to perform long-horizon navigation in unfamiliar environments based on complex linguistic instructions. Its success critically hinges on establishing an efficient ``language-understanding -- visual-perception -- embodied-execution'' closed loop. Existing methods often suffer from perceptual distortion and decision drift in complex, long-distance tasks due to the cognitive overload of a single agent. Inspired by distributed cognition theory, this paper proposes MA-CoNav, a Multi-Agent Collaborative Navigation framework. This framework adopts a ``Master-Slave'' hierarchical agent collaboration architecture, decoupling and distributing the perception, planning, execution, and memory functions required for navigation tasks to specialized agents. Specifically, the Master Agent is responsible for global orchestration, while the Subordinate Agent group collaborates through a clear division of labor: an Observation Agent generates environment descriptions, a Planning Agent performs task decomposition and dynamic verification, an Execution Agent handles simultaneous mapping and action, and a Memory Agent manages structured experiences. Furthermore, the framework introduces a ``Local-Global'' dual-stage reflection mechanism to dynamically optimize the entire navigation pipeline. Empirical experiments were conducted using a real-world indoor dataset collected by a Limo Pro robot, with no scene-specific fine-tuning performed on the models throughout the process. The results demonstrate that MA-CoNav comprehensively outperforms existing mainstream VLN methods across multiple metrics.

标签

VLN Multi-Agent Robot Navigation Hierarchical Collaboration

arXiv 分类

cs.RO cs.AI