CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents
AI 摘要
CLAG提出了一种基于聚类的SLM Agent记忆框架,通过Agent主动组织记忆,提高检索效率和知识密度。
主要贡献
- 提出了基于聚类的Agentic记忆框架CLAG
- 使用SLM驱动的路由器进行记忆聚类和 профилирование
- 实验证明CLAG能有效提高SLM Agent在QA任务中的性能
方法论
使用SLM Agent驱动的聚类方法,将记忆组织成语义连贯的簇,并为每个簇生成 профиль,进行两阶段检索。
原文摘要
Large language model agents heavily rely on external memory to support knowledge reuse and complex reasoning tasks. Yet most memory systems store experiences in a single global retrieval pool which can gradually dilute or corrupt stored knowledge. This problem is especially pronounced for small language models (SLMs), which are highly vulnerable to irrelevant context. We introduce CLAG, a CLustering-based AGentic memory framework where an SLM agent actively organizes memory by clustering. CLAG employs an SLM-driven router to assign incoming memories to semantically coherent clusters and autonomously generates cluster-specific profiles, including topic summaries and descriptive tags, to establish each cluster as a self-contained functional unit. By performing localized evolution within these structured neighborhoods, CLAG effectively reduces cross-topic interference and enhances internal memory density. During retrieval, the framework utilizes a two-stage process that first filters relevant clusters via their profiles, thereby excluding distractors and reducing the search space. Experiments on multiple QA datasets with three SLM backbones show that CLAG consistently improves answer quality and robustness over prior memory systems for agents, remaining lightweight and efficient.