AI Agents 相关度: 9/10

Probe-then-Plan: Environment-Aware Planning for Industrial E-commerce Search

Mengxiang Chen, Zhouwei Zhai, Jin Li
arXiv: 2603.15262v1 发布: 2026-03-16 更新: 2026-03-16

AI 摘要

提出EASP框架,通过检索探测指导LLM搜索规划,解决电商搜索中效率与效果的平衡问题。

主要贡献

  • 提出Probe-then-Plan机制,将环境信息融入搜索规划。
  • 设计离线数据合成和在线自适应服务流程。
  • 在JD.com的电商搜索系统上成功部署并验证了EASP的有效性。

方法论

构建教师Agent合成数据,通过SFT初始化Planner,再用强化学习对齐业务目标,最后自适应在线服务。

原文摘要

Modern e-commerce search is evolving to resolve complex user intents. While Large Language Models (LLMs) offer strong reasoning, existing LLM-based paradigms face a fundamental blindness-latency dilemma: query rewriting is agnostic to retrieval capabilities and real-time inventory, yielding invalid plans; conversely, deep search agents rely on iterative tool calls and reflection, incurring seconds of latency incompatible with industrial sub-second budgets. To resolve this conflict, we propose Environment-Aware Search Planning (EASP), reformulating search planning as a dynamic reasoning process grounded in environmental reality. EASP introduces a Probe-then-Plan mechanism: a lightweight Retrieval Probe exposes the retrieval snapshot, enabling the Planner to diagnose execution gaps and generate grounded search plans. The methodology comprises three stages: (1) Offline Data Synthesis: A Teacher Agent synthesizes diverse, execution-validated plans by diagnosing the probed environment. (2) Planner Training and Alignment: The Planner is initialized via Supervised Fine-Tuning (SFT) to internalize diagnostic capabilities, then aligned with business outcomes (conversion rate) via Reinforcement Learning (RL). (3) Adaptive Online Serving: A complexity-aware routing mechanism selectively activates planning for complex queries, ensuring optimal resource allocation. Extensive offline evaluations and online A/B testing on JD.com demonstrate that EASP significantly improves relevant recall and achieves substantial lifts in UCVR and GMV. EASP has been successfully deployed in JD.com's AI-Search system.

标签

电商搜索 LLM 搜索规划 强化学习

arXiv 分类

cs.AI