Resource-constrained Amazons chess decision framework integrating large language models and graph attention
AI 摘要
结合图注意力网络和LLM,在资源约束下实现Amazons棋高性能决策。
主要贡献
- 提出轻量级混合框架,结合图结构推理和LLM生成能力
- 利用图注意力自编码器指导多步蒙特卡洛树搜索
- 使用随机图遗传算法优化评估信号,利用GPT-4o-mini生成合成训练数据
方法论
结合图注意力自编码器、蒙特卡洛树搜索、随机图遗传算法和GPT-4o-mini,实现弱监督学习。
原文摘要
Artificial intelligence has advanced significantly through the development of intelligent game-playing systems, providing rigorous testbeds for decision-making, strategic planning, and adaptive learning. However, resource-constrained environments pose critical challenges, as conventional deep learning methods heavily rely on extensive datasets and computational resources. In this paper, we propose a lightweight hybrid framework for the Game of the Amazons, which explores the paradigm of weak-to-strong generalization by integrating the structural reasoning of graph-based learning with the generative capabilities of large language models. Specifically, we leverage a Graph Attention Autoencoder to inform a multi-step Monte Carlo Tree Search, utilize a Stochastic Graph Genetic Algorithm to optimize evaluation signals, and harness GPT-4o-mini to generate synthetic training data. Unlike traditional approaches that rely on expert demonstrations, our framework learns from noisy and imperfect supervision. We demonstrate that the Graph Attention mechanism effectively functions as a structural filter, denoising the LLM's outputs. Experiments on a 10$\times$10 Amazons board show that our hybrid approach not only achieves a 15\%--56\% improvement in decision accuracy over baselines but also significantly outperforms its teacher model (GPT-4o-mini), achieving a competitive win rate of 45.0\% at N=30 nodes and a decisive 66.5\% at only N=50 nodes. These results verify the feasibility of evolving specialized, high-performance game AI from general-purpose foundation models under stringent computational constraints.