LLM Memory & RAG 相关度: 9/10

Legal-DC: Benchmarking Retrieval-Augmented Generation for Legal Documents

Yaocong Li, Qiang Lan, Leihan Zhang, Le Zhang
arXiv: 2603.11772v1 发布: 2026-03-12 更新: 2026-03-12

AI 摘要

构建中文法律RAG基准Legal-DC,提出LegRAG框架,提升法律检索与生成性能。

主要贡献

  • 构建Legal-DC中文法律RAG基准数据集
  • 提出LegRAG框架,结合法律自适应索引和双路径自反思机制
  • 提出针对法律检索场景的自动评估方法

方法论

构建数据集,提出包含法律自适应索引和双路径自反思的LegRAG框架,并进行实验评估。

原文摘要

Retrieval-Augmented Generation (RAG) has emerged as a promising technology for legal document consultation, yet its application in Chinese legal scenarios faces two key limitations: existing benchmarks lack specialized support for joint retriever-generator evaluation, and mainstream RAG systems often fail to accommodate the structured nature of legal provisions. To address these gaps, this study advances two core contributions: First, we constructed the Legal-DC benchmark dataset, comprising 480 legal documents (covering areas such as market regulation and contract management) and 2,475 refined question-answer pairs, each annotated with clause-level references, filling the gap for specialized evaluation resources in Chinese legal RAG. Second, we propose the LegRAG framework, which integrates legal adaptive indexing (clause-boundary segmentation) with a dual-path self-reflection mechanism to ensure clause integrity while enhancing answer accuracy. Third, we introduce automated evaluation methods for large language models to meet the high-reliability demands of legal retrieval scenarios. LegRAG outperforms existing state-of-the-art methods by 1.3% to 5.6% across key evaluation metrics. This research provides a specialized benchmark, practical framework, and empirical insights to advance the development of Chinese legal RAG systems. Our code and data are available at https://github.com/legal-dc/Legal-DC.

标签

RAG 法律 检索增强生成 中文

arXiv 分类

cs.CL