LLM Memory & RAG 相关度: 9/10

TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA

Mengwei Yuan, Jianan Liu, Jing Yang, Xianyou Li, Weiran Yan, Yichao Wu, Penghao Liang
arXiv: 2603.09297v1 发布: 2026-03-10 更新: 2026-03-10

AI 摘要

提出TA-Mem框架,通过工具增强的自主记忆检索,提升LLM在长程对话问答中的表现。

主要贡献

  • 提出了一种工具增强的自主记忆检索框架TA-Mem
  • 设计了基于语义相关的自适应分块和结构化信息提取的记忆提取LLM Agent
  • 构建了多索引记忆数据库,支持基于键值和相似度的检索
  • 实现了工具增强的记忆检索Agent,能根据用户输入自主选择工具进行检索和推理

方法论

利用记忆提取Agent将输入分块并提取信息,存储在多索引数据库中,检索Agent通过工具自主探索记忆并推理。

原文摘要

Large Language Model (LLM) has exhibited strong reasoning ability in text-based contexts across various domains, yet the limitation of context window poses challenges for the model on long-range inference tasks and necessitates a memory storage system. While many current storage approaches have been proposed with episodic notes and graph representations of memory, retrieval methods still primarily rely on predefined workflows or static similarity top-k over embeddings. To address this inflexibility, we introduced a novel tool-augmented autonomous memory retrieval framework (TA-Mem), which contains: (1) a memory extraction LLM agent which is prompted to adaptively chuck an input into sub-context based on semantic correlation, and extract information into structured notes, (2) a multi-indexed memory database designed for different types of query methods including both key-based lookup and similarity-based retrieval, (3) a tool-augmented memory retrieval agent which explores the memory autonomously by selecting appropriate tools provided by the database based on the user input, and decides whether to proceed to the next iteration or finalizing the response after reasoning on the fetched memories. The TA-Mem is evaluated on the LoCoMo dataset, achieving significant performance improvements over existing baseline approaches. In addition, an analysis of tool use across different question types also demonstrates the adaptivity of the proposed method.

标签

LLM Memory Retrieval Tool-Augmented Agent Long-Term Conversation Question Answering

arXiv 分类

cs.IR cs.CL