Diagnosing and Repairing Citation Failures in Generative Engine Optimization
AI 摘要
提出AgentGEO框架,诊断并修复生成引擎优化中的引用失败问题,显著提升引用率。
主要贡献
- 提出了引用失败模式的分类体系
- 构建了AgentGEO代理系统,用于诊断和修复引用失败
- 创建了文档中心基准,用于评估优化的泛化能力
方法论
构建AgentGEO,利用诊断分类体系,从工具库中选择修复策略,迭代优化直到成功引用。
原文摘要
Generative Engine Optimization (GEO) aims to improve content visibility in AI-generated responses. However, existing methods measure contribution-how much a document influences a response-rather than citation, the mechanism that actually drives traffic back to creators. Also, these methods apply generic rewriting rules uniformly, failing to diagnose why individual document are not cited. This paper introduces a diagnostic approach to GEO that asks why a document fails to be cited and intervenes accordingly. We develop a unified framework comprising: (1) the first taxonomy of citation failure modes spanning different stages of a citation pipeline; (2) AgentGEO, an agentic system that diagnoses failures using this taxonomy, selects targeted repairs from a corresponding tool library, and iterates until citation is achieved; and (3) a document-centric benchmark evaluating whether optimizations generalize across held-out queries. AgentGEO achieves over 40% relative improvement in citation rates while modifying only 5% of content, compared to 25% for baselines. Our analysis reveals that generic optimization can harm long-tail content and some documents face challenges that optimization alone cannot fully address-findings with implications for equitable visibility in AI-mediated information access.