AI Agents 相关度: 7/10

Optimal Rates for Feasible Payoff Set Estimation in Games

Annalisa Barbara, Riccardo Poiani, Martino Bernasconi, Andrea Celli
arXiv: 2602.04397v1 发布: 2026-02-04 更新: 2026-02-04

AI 摘要

研究逆向博弈论中可行收益集合估计的最优速率,并提供理论基础。

主要贡献

  • 提出零和及一般和博弈中精确和近似均衡博弈的最优最小最大速率
  • 为多智能体环境中的集合值收益推断提供学习理论基础
  • 研究了在仅观察玩家行为时推断玩家收益函数的难题

方法论

通过学习理论方法,推导并证明了在 Hausdorff 距离下可行收益集合估计的 minimax 最优速率。

原文摘要

We study a setting in which two players play a (possibly approximate) Nash equilibrium of a bimatrix game, while a learner observes only their actions and has no knowledge of the equilibrium or the underlying game. A natural question is whether the learner can rationalize the observed behavior by inferring the players' payoff functions. Rather than producing a single payoff estimate, inverse game theory aims to identify the entire set of payoffs consistent with observed behavior, enabling downstream use in, e.g., counterfactual analysis and mechanism design across applications like auctions, pricing, and security games. We focus on the problem of estimating the set of feasible payoffs with high probability and up to precision $ε$ on the Hausdorff metric. We provide the first minimax-optimal rates for both exact and approximate equilibrium play, in zero-sum as well as general-sum games. Our results provide learning-theoretic foundations for set-valued payoff inference in multi-agent environments.

标签

逆向博弈论 可行收益集合估计 学习理论 博弈论

arXiv 分类

cs.GT cs.LG