AI Agents 相关度: 9/10

CommCP: Efficient Multi-Agent Coordination via LLM-Based Communication with Conformal Prediction

Xiaopan Zhang, Zejin Wang, Zhixu Li, Jianpeng Yao, Jiachen Li
arXiv: 2602.06038v1 发布: 2026-02-05 更新: 2026-02-05

AI 摘要

论文提出了CommCP框架,利用LLM和一致性预测解决多智能体多任务具身问答中的通信协作问题。

主要贡献

  • 提出了多智能体多任务具身问答 (MM-EQA) 问题
  • 设计了基于LLM和一致性预测的去中心化通信框架CommCP
  • 构建了包含真实家庭场景的MM-EQA基准数据集

方法论

使用LLM生成消息,通过一致性预测校准消息,减少接收者干扰,提高通信可靠性,提升任务成功率和探索效率。

原文摘要

To complete assignments provided by humans in natural language, robots must interpret commands, generate and answer relevant questions for scene understanding, and manipulate target objects. Real-world deployments often require multiple heterogeneous robots with different manipulation capabilities to handle different assignments cooperatively. Beyond the need for specialized manipulation skills, effective information gathering is important in completing these assignments. To address this component of the problem, we formalize the information-gathering process in a fully cooperative setting as an underexplored multi-agent multi-task Embodied Question Answering (MM-EQA) problem, which is a novel extension of canonical Embodied Question Answering (EQA), where effective communication is crucial for coordinating efforts without redundancy. To address this problem, we propose CommCP, a novel LLM-based decentralized communication framework designed for MM-EQA. Our framework employs conformal prediction to calibrate the generated messages, thereby minimizing receiver distractions and enhancing communication reliability. To evaluate our framework, we introduce an MM-EQA benchmark featuring diverse, photo-realistic household scenarios with embodied questions. Experimental results demonstrate that CommCP significantly enhances the task success rate and exploration efficiency over baselines. The experiment videos, code, and dataset are available on our project website: https://comm-cp.github.io.

标签

多智能体 LLM 具身问答 通信 一致性预测

arXiv 分类

cs.RO cs.AI cs.CV cs.LG cs.MA