AI Agents 相关度: 9/10

Verifiable Semantics for Agent-to-Agent Communication

Philipp Schoenegger, Matt Carlson, Chris Schneider, Chris Daly
arXiv: 2602.16424v1 发布: 2026-02-18 更新: 2026-02-18

AI 摘要

提出了一种可验证的多智能体通信框架,降低语义分歧,提升一致性。

主要贡献

  • 提出基于刺激-意义模型(stimulus-meaning model)的认证协议。
  • 核心保护推理(core-guarded reasoning)可证明地限制分歧。
  • 提出语义漂移检测和词汇恢复机制。

方法论

通过共享可观察事件测试智能体,使用统计阈值认证术语,限制智能体使用认证术语进行推理。

原文摘要

Multiagent AI systems require consistent communication, but we lack methods to verify that agents share the same understanding of the terms used. Natural language is interpretable but vulnerable to semantic drift, while learned protocols are efficient but opaque. We propose a certification protocol based on the stimulus-meaning model, where agents are tested on shared observable events and terms are certified if empirical disagreement falls below a statistical threshold. In this protocol, agents restricting their reasoning to certified terms ("core-guarded reasoning") achieve provably bounded disagreement. We also outline mechanisms for detecting drift (recertification) and recovering shared vocabulary (renegotiation). In simulations with varying degrees of semantic divergence, core-guarding reduces disagreement by 72-96%. In a validation with fine-tuned language models, disagreement is reduced by 51%. Our framework provides a first step towards verifiable agent-to-agent communication.

标签

multi-agent systems communication protocols verifiable semantics semantic drift

arXiv 分类

cs.AI cs.MA