AI Agents 相关度: 7/10

Generalized Rapid Action Value Estimation in Memory-Constrained Environments

Aloïs Rautureau, Tristan Cazenave, Éric Piette
arXiv: 2602.23318v1 发布: 2026-02-26 更新: 2026-02-26

AI 摘要

提出GRAVE2、GRAVER、GRAVER2算法,减少内存占用,保持GRAVE的博弈强度。

主要贡献

  • 提出了GRAVE2算法,通过两层搜索扩展GRAVE。
  • 提出了GRAVER算法,利用节点回收减少内存占用。
  • 提出了GRAVER2算法,结合两层搜索和节点回收。
  • 实验证明新算法在降低内存占用的同时,保持了GRAVE的博弈强度。

方法论

通过两层搜索、节点回收以及二者结合的策略,在内存受限的环境下改进GRAVE算法。

原文摘要

Generalized Rapid Action Value Estimation (GRAVE) has been shown to be a strong variant within the Monte-Carlo Tree Search (MCTS) family of algorithms for General Game Playing (GGP). However, its reliance on storing additional win/visit statistics at each node makes its use impractical in memory-constrained environments, thereby limiting its applicability in practice. In this paper, we introduce the GRAVE2, GRAVER and GRAVER2 algorithms, which extend GRAVE through two-level search, node recycling, and a combination of both techniques, respectively. We show that these enhancements enable a drastic reduction in the number of stored nodes while matching the playing strength of GRAVE.

标签

GGP MCTS GRAVE 内存优化

arXiv 分类

cs.AI