LLM Memory & RAG 相关度: 9/10

Improving Neural Retrieval with Attribution-Guided Query Rewriting

Moncef Garouani, Josiane Mothe
arXiv: 2602.11841v1 发布: 2026-02-12 更新: 2026-02-12

AI 摘要

提出一种基于Token级别归因的查询重写方法,提升神经检索器的性能。

主要贡献

  • 利用检索器反馈指导查询重写
  • 使用token-level归因引导LLM生成更清晰的查询
  • 在BEIR数据集上验证了有效性

方法论

计算检索器的梯度token归因,作为LLM提示的软指导,重写查询。

原文摘要

Neural retrievers are effective but brittle: underspecified or ambiguous queries can misdirect ranking even when relevant documents exist. Existing approaches address this brittleness only partially: LLMs rewrite queries without retriever feedback, and explainability methods identify misleading tokens but are used for post-hoc analysis. We close this loop and propose an attribution-guided query rewriting method that uses token-level explanations to guide query rewriting. For each query, we compute gradient-based token attributions from the retriever and then use these scores as soft guidance in a structured prompt to an LLM that clarifies weak or misleading query components while preserving intent. Evaluated on BEIR collections, the resulting rewrites consistently improve retrieval effectiveness over strong baselines, with larger gains for implicit or ambiguous information needs.

标签

神经检索 查询重写 可解释性 LLM

arXiv 分类

cs.IR cs.AI cs.LG