Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges
AI 摘要
综述了多智能体视频推荐系统的演进、模式、挑战和未来方向,重点关注LLM驱动的架构。
主要贡献
- 总结了多智能体视频推荐系统的发展历程
- 提出了多智能体协作模式的分类
- 探讨了该领域的开放性挑战和未来研究方向
方法论
通过文献综述,分析了多智能体推荐系统、基础模型和对话式AI的相关研究,并提出了新的分类体系。
原文摘要
Video recommender systems are among the most popular and impactful applications of AI, shaping content consumption and influencing culture for billions of users. Traditional single-model recommenders, which optimize static engagement metrics, are increasingly limited in addressing the dynamic requirements of modern platforms. In response, multi-agent architectures are redefining how video recommender systems serve, learn, and adapt to both users and datasets. These agent-based systems coordinate specialized agents responsible for video understanding, reasoning, memory, and feedback, to provide precise, explainable recommendations. In this survey, we trace the evolution of multi-agent video recommendation systems (MAVRS). We combine ideas from multi-agent recommender systems, foundation models, and conversational AI, culminating in the emerging field of large language model (LLM)-powered MAVRS. We present a taxonomy of collaborative patterns and analyze coordination mechanisms across diverse video domains, ranging from short-form clips to educational platforms. We discuss representative frameworks, including early multi-agent reinforcement learning (MARL) systems such as MMRF and recent LLM-driven architectures like MACRec and Agent4Rec, to illustrate these patterns. We also outline open challenges in scalability, multimodal understanding, incentive alignment, and identify research directions such as hybrid reinforcement learning-LLM systems, lifelong personalization and self-improving recommender systems.