GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum
AI 摘要
GraphWalker通过自动轨迹合成和阶段性微调,提升了Agent在知识图谱问答中的推理泛化能力。
主要贡献
- 提出了Automated Trajectory Synthesis方法,生成多样化探索轨迹
- 提出了Stage-wise Fine-tuning策略,增强Agent的反射和纠错能力
- GraphWalker在多个KGQA数据集上取得了SOTA性能,并提升了OOD泛化能力
方法论
使用约束随机游走生成结构多样的轨迹,作为第一阶段SFT训练数据。第二阶段SFT使用少量专家轨迹进行微调,之后进行RL。
原文摘要
Agentic knowledge graph question answering (KGQA) requires an agent to iteratively interact with knowledge graphs (KGs), posing challenges in both training data scarcity and reasoning generalization. Specifically, existing approaches often restrict agent exploration: prompting-based methods lack autonomous navigation training, while current training pipelines usually confine reasoning to predefined trajectories. To this end, this paper proposes \textit{GraphWalker}, a novel agentic KGQA framework that addresses these challenges through \textit{Automated Trajectory Synthesis} and \textit{Stage-wise Fine-tuning}. GraphWalker adopts a two-stage SFT training paradigm: First, the agent is trained on structurally diverse trajectories synthesized from constrained random-walk paths, establishing a broad exploration prior over the KG; Second, the agent is further fine-tuned on a small set of expert trajectories to develop reflection and error recovery capabilities. Extensive experiments demonstrate that our stage-wise SFT paradigm unlocks a higher performance ceiling for a lightweight reinforcement learning (RL) stage, enabling GraphWalker to achieve state-of-the-art performance on CWQ and WebQSP. Additional results on GrailQA and our constructed GraphWalkerBench confirm that GraphWalker enhances generalization to out-of-distribution reasoning paths. The code is publicly available at https://github.com/XuShuwenn/GraphWalker