LLM Memory & RAG 相关度: 9/10

D-Mem: A Dual-Process Memory System for LLM Agents

Zhixing You, Jiachen Yuan, Jason Cai
arXiv: 2603.18631v1 发布: 2026-03-19 更新: 2026-03-19

AI 摘要

D-Mem是一种双过程记忆系统,通过质量门控策略平衡效率与准确性,提升LLM Agent的长期推理能力。

主要贡献

  • 提出了D-Mem双过程记忆系统
  • 设计了多维质量门控策略
  • 实验证明D-Mem在效率和性能上的优势

方法论

D-Mem结合轻量级向量检索和详尽的Full Deliberation模块,通过质量门控动态选择,实现认知经济与高保真记忆的平衡。

原文摘要

Driven by the development of persistent, self-adapting autonomous agents, equipping these systems with high-fidelity memory access for long-horizon reasoning has emerged as a critical requirement. However, prevalent retrieval-based memory frameworks often follow an incremental processing paradigm that continuously extracts and updates conversational memories into vector databases, relying on semantic retrieval when queried. While this approach is fast, it inherently relies on lossy abstraction, frequently missing contextually critical information and struggling to resolve queries that rely on fine-grained contextual understanding. To address this, we introduce D-Mem, a dual-process memory system. It retains lightweight vector retrieval for routine queries while establishing an exhaustive Full Deliberation module as a high-fidelity fallback. To achieve cognitive economy without sacrificing accuracy, D-Mem employs a Multi-dimensional Quality Gating policy to dynamically bridge these two processes. Experiments on the LoCoMo and RealTalk benchmarks using GPT-4o-mini and Qwen3-235B-Instruct demonstrate the efficacy of our approach. Notably, our Multi-dimensional Quality Gating policy achieves an F1 score of 53.5 on LoCoMo with GPT-4o-mini. This outperforms our static retrieval baseline, Mem0$^\ast$ (51.2), and recovers 96.7\% of the Full Deliberation's performance (55.3), while incurring significantly lower computational costs.

标签

LLM Agent Memory Retrieval Reasoning

arXiv 分类

cs.AI