Agent Tuning & Optimization 相关度: 9/10

Memento-Skills: Let Agents Design Agents

Huichi Zhou, Siyuan Guo, Anjie Liu, Zhongwei Yu, Ziqin Gong, Bowen Zhao, Zhixun Chen, Menglong Zhang, Yihang Chen, Jinsong Li, Runyu Yang, Qiangbin Liu, Xinlei Yu, Jianmin Zhou, Na Wang, Chunyang Sun, Jun Wang
arXiv: 2603.18743v1 发布: 2026-03-19 更新: 2026-03-19

AI 摘要

Memento-Skills构建了一个通过经验自主设计和改进agent的通用可持续学习LLM agent系统。

主要贡献

  • 提出了Memento-Skills,一个agent-designing agent系统。
  • 引入了基于记忆的强化学习框架,使用状态提示和可重用技能作为持续演进的记忆。
  • 开发了Read--Write Reflective Learning机制,实现无需更新LLM参数的持续学习。

方法论

Memento-Skills通过读写反思学习机制,不断更新和扩展技能库,实现agent的持续改进,无需更新LLM参数。

原文摘要

We introduce \emph{Memento-Skills}, a generalist, continually-learnable LLM agent system that functions as an \emph{agent-designing agent}: it autonomously constructs, adapts, and improves task-specific agents through experience. The system is built on a memory-based reinforcement learning framework with \emph{stateful prompts}, where reusable skills (stored as structured markdown files) serve as persistent, evolving memory. These skills encode both behaviour and context, enabling the agent to carry forward knowledge across interactions. Starting from simple elementary skills (like Web search and terminal operations), the agent continually improves via the \emph{Read--Write Reflective Learning} mechanism introduced in \emph{Memento~2}~\cite{wang2025memento2}. In the \emph{read} phase, a behaviour-trainable skill router selects the most relevant skill conditioned on the current stateful prompt; in the \emph{write} phase, the agent updates and expands its skill library based on new experience. This closed-loop design enables \emph{continual learning without updating LLM parameters}, as all adaptation is realised through the evolution of externalised skills and prompts. Unlike prior approaches that rely on human-designed agents, Memento-Skills enables a generalist agent to \emph{design agents end-to-end} for new tasks. Through iterative skill generation and refinement, the system progressively improves its own capabilities. Experiments on the \emph{General AI Assistants} benchmark and \emph{Humanity's Last Exam} demonstrate sustained gains, achieving 26.2\% and 116.2\% relative improvements in overall accuracy, respectively. Code is available at https://github.com/Memento-Teams/Memento-Skills.

标签

AI Agent Continual Learning Memory Skill Learning

arXiv 分类

cs.AI cs.CL cs.LG