Best-Arm Identification with Noisy Actuation
arXiv: 2604.02255v1
发布: 2026-04-02
更新: 2026-04-02
AI 摘要
研究在有噪声信道下,如何通过通信策略在多臂老虎机问题中识别最佳臂。
主要贡献
- 提出适用于不同agent能力的通信方案
- 分析通信方案与信道零错误容量的关系
- 研究噪声环境下的最佳臂识别
方法论
分析多臂老虎机在离散无记忆信道下的通信策略,结合agent能力进行优化。
原文摘要
In this paper, we consider a multi-armed bandit (MAB) instance and study how to identify the best arm when arm commands are conveyed from a central learner to a distributed agent over a discrete memoryless channel (DMC). Depending on the agent capabilities, we provide communication schemes along with their analysis, which interestingly relate to the zero-error capacity of the underlying DMC.