Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair
AI 摘要
Polaris通过经验抽象进行策略修复,提升小语言模型的递归自改进能力。
主要贡献
- 提出了Polaris框架,用于小语言模型的Gödel Agent。
- 引入经验抽象,将失败转化为可复用的策略。
- 通过策略级别的修改而非参数调整,实现持久改进。
方法论
Polaris通过分析、策略形成、抽象和代码修复的循环,实现策略修复和改进,包含元推理过程。
原文摘要
Gödel agent realize recursive self-improvement: an agent inspects its own policy and traces and then modifies that policy in a tested loop. We introduce Polaris, a Gödel agent for compact models that performs policy repair via experience abstraction, turning failures into policy updates through a structured cycle of analysis, strategy formation, abstraction, and minimal code pat ch repair with conservative checks. Unlike response level self correction or parameter tuning, Polaris makes policy level changes with small, auditable patches that persist in the policy and are reused on unseen instances within each benchmark. As part of the loop, the agent engages in meta reasoning: it explains its errors, proposes concrete revisions to its own policy, and then updates the policy. To enable cumulative policy refinement, we introduce experience abstraction, which distills failures into compact, reusable strategies that transfer to unseen instances. On MGSM, DROP, GPQA, and LitBench (covering arithmetic reasoning, compositional inference, graduate-level problem solving, and creative writing evaluation), a 7-billion-parameter model equipped with Polaris achieves consistent gains over the base policy and competitive baselines.