AI Agents 相关度: 8/10

Resource-constrained Amazons chess decision framework integrating large language models and graph attention

Tianhao Qian, Zhuoxuan Li, Jinde Cao, Xinli Shi, Hanjie Liu, Leszek Rutkowski

arXiv: 2603.10512v1 发布: 2026-03-11 更新: 2026-03-11

下载 PDF arXiv 页面

AI 摘要

结合图注意力网络和LLM，在资源约束下实现Amazons棋高性能决策。

主要贡献

提出轻量级混合框架，结合图结构推理和LLM生成能力
利用图注意力自编码器指导多步蒙特卡洛树搜索
使用随机图遗传算法优化评估信号，利用GPT-4o-mini生成合成训练数据

方法论

结合图注意力自编码器、蒙特卡洛树搜索、随机图遗传算法和GPT-4o-mini，实现弱监督学习。

原文摘要

Artificial intelligence has advanced significantly through the development of intelligent game-playing systems, providing rigorous testbeds for decision-making, strategic planning, and adaptive learning. However, resource-constrained environments pose critical challenges, as conventional deep learning methods heavily rely on extensive datasets and computational resources. In this paper, we propose a lightweight hybrid framework for the Game of the Amazons, which explores the paradigm of weak-to-strong generalization by integrating the structural reasoning of graph-based learning with the generative capabilities of large language models. Specifically, we leverage a Graph Attention Autoencoder to inform a multi-step Monte Carlo Tree Search, utilize a Stochastic Graph Genetic Algorithm to optimize evaluation signals, and harness GPT-4o-mini to generate synthetic training data. Unlike traditional approaches that rely on expert demonstrations, our framework learns from noisy and imperfect supervision. We demonstrate that the Graph Attention mechanism effectively functions as a structural filter, denoising the LLM's outputs. Experiments on a 10$\times$10 Amazons board show that our hybrid approach not only achieves a 15\%--56\% improvement in decision accuracy over baselines but also significantly outperforms its teacher model (GPT-4o-mini), achieving a competitive win rate of 45.0\% at N=30 nodes and a decisive 66.5\% at only N=50 nodes. These results verify the feasibility of evolving specialized, high-performance game AI from general-purpose foundation models under stringent computational constraints.

arXiv 分类

cs.AI cs.LG cs.NE

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类