AI Agents 相关度: 7/10

Trust-Aware Routing for Distributed Generative AI Inference at the Edge

Chanh Nguyen, Erik Elmroth

arXiv: 2603.28622v1 发布: 2026-03-30 更新: 2026-03-30

下载 PDF arXiv 页面

AI 摘要

G-TRAC通过信任感知路由提高边缘分布式生成AI推理的鲁棒性和可靠性。

主要贡献

提出了一种信任感知的分布式推理协调框架G-TRAC
设计了基于风险约束最短路径的路由算法，实现低延迟的路径选择
提出了混合信任架构，维护全局声誉状态并支持边缘节点的轻量级更新

方法论

结合信任度进行风险评估，使用Dijkstra算法优化路由，并采用混合信任架构动态更新信任信息。

原文摘要

Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous edge devices rather than on a single trusted server. In such environments, a single device failure or misbehavior can disrupt the entire inference process, making traditional best-effort peer-to-peer routing insufficient. Coordinating distributed generative inference therefore requires mechanisms that explicitly account for reliability, performance variability, and trust among participating peers. In this paper, we present G-TRAC, a trust-aware coordination framework that integrates algorithmic path selection with system-level protocol design to ensure robust distributed inference. First, we formulate the routing problem as a \textit{Risk-Bounded Shortest Path} computation and introduce a polynomial-time solution that combines trust-floor pruning with Dijkstra's search, achieving sub-millisecond median routing latency at practical edge scales, and remaining below 10 ms at larger scales. Second, to operationally support the routing logic in dynamic environments, the framework employs a \textit{Hybrid Trust Architecture} that maintains global reputation state at stable anchors while disseminating lightweight updates to edge peers via background synchronization. Experimental evaluation on a heterogeneous testbed of commodity devices demonstrates that G-TRAC significantly improves inference completion rates, effectively isolates unreliable peers, and sustains robust execution even under node failures and network partitions.

arXiv 分类

cs.DC cs.AI cs.NI

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类