LLM Memory & RAG 相关度: 9/10

With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots

Zeinab Sadat Taghavi, Ali Modarressi, Hinrich Schutze, Andreas Marfurt
arXiv: 2602.09616v1 发布: 2026-02-10 更新: 2026-02-10

AI 摘要

该论文提出ARGUS方法,通过预先识别并修复检索盲点来提升RAG系统的检索效果。

主要贡献

  • 发现RAG系统中神经检索器的检索盲点
  • 提出Retrieval Probability Score (RPS)用于预测检索盲点
  • 设计ARGUS流程通过知识库增强来解决检索盲点

方法论

通过Wikidata构建大规模数据集,使用RPS预测检索盲点,并利用知识库进行文档增强,最终在多个数据集上验证了ARGUS的有效性。

原文摘要

Reliable retrieval-augmented generation (RAG) systems depend fundamentally on the retriever's ability to find relevant information. We show that neural retrievers used in RAG systems have blind spots, which we define as the failure to retrieve entities that are relevant to the query, but have low similarity to the query embedding. We investigate the training-induced biases that cause such blind spot entities to be mapped to inaccessible parts of the embedding space, resulting in low retrievability. Using a large-scale dataset constructed from Wikidata relations and first paragraphs of Wikipedia, and our proposed Retrieval Probability Score (RPS), we show that blind spot risk in standard retrievers (e.g., CONTRIEVER, REASONIR) can be predicted pre-index from entity embedding geometry, avoiding expensive retrieval evaluations. To address these blind spots, we introduce ARGUS, a pipeline that enables the retrievability of high-risk (low-RPS) entities through targeted document augmentation from a knowledge base (KB), first paragraphs of Wikipedia, in our case. Extensive experiments on BRIGHT, IMPLIRET, and RAR-B show that ARGUS achieves consistent improvements across all evaluated retrievers (averaging +3.4 nDCG@5 and +4.5 nDCG@10 absolute points), with substantially larger gains in challenging subsets. These results establish that preemptively remedying blind spots is critical for building robust and trustworthy RAG systems (Code and Data).

标签

RAG 检索增强 检索盲点 知识库

arXiv 分类

cs.IR cs.AI