LLM Memory & RAG 相关度: 9/10

Welfarist Formulations for Diverse Similarity Search

Siddharth Barman, Nirjhar Das, Shivam Gupta, Kirankumar Shiragur
arXiv: 2602.08742v1 发布: 2026-02-09 更新: 2026-02-09

AI 摘要

提出了基于福利函数的近邻搜索算法,提升检索结果的多样性,并兼顾相关性。

主要贡献

  • 提出了基于福利函数的多样性近邻搜索目标函数
  • 设计了高效的算法,可与现有ANN方法结合
  • 实验证明算法有效提升多样性并保持相关性

方法论

基于福利函数(特别是Nash社会福利)构建目标函数,并设计算法优化该目标函数。

原文摘要

Nearest Neighbor Search (NNS) is a fundamental problem in data structures with wide-ranging applications, such as web search, recommendation systems, and, more recently, retrieval-augmented generations (RAG). In such recent applications, in addition to the relevance (similarity) of the returned neighbors, diversity among the neighbors is a central requirement. In this paper, we develop principled welfare-based formulations in NNS for realizing diversity across attributes. Our formulations are based on welfare functions -- from mathematical economics -- that satisfy central diversity (fairness) and relevance (economic efficiency) axioms. With a particular focus on Nash social welfare, we note that our welfare-based formulations provide objective functions that adaptively balance relevance and diversity in a query-dependent manner. Notably, such a balance was not present in the prior constraint-based approach, which forced a fixed level of diversity and optimized for relevance. In addition, our formulation provides a parametric way to control the trade-off between relevance and diversity, providing practitioners with flexibility to tailor search results to task-specific requirements. We develop efficient nearest neighbor algorithms with provable guarantees for the welfare-based objectives. Notably, our algorithm can be applied on top of any standard ANN method (i.e., use standard ANN method as a subroutine) to efficiently find neighbors that approximately maximize our welfare-based objectives. Experimental results demonstrate that our approach is practical and substantially improves diversity while maintaining high relevance of the retrieved neighbors.

标签

近邻搜索 多样性 福利函数 推荐系统 RAG

arXiv 分类

cs.DS cs.CG cs.GT cs.IR cs.LG