AI Agents 相关度: 8/10

Curiosity is Knowledge: Self-Consistent Learning and No-Regret Optimization with Active Inference

Yingke Li, Anjali Parashar, Enlu Zhou, Chuchu Fan
arXiv: 2602.06029v1 发布: 2026-02-05 更新: 2026-02-05

AI 摘要

该论文提出了主动推理(AIF)框架下,通过“足够的好奇心”实现一致学习和无悔优化的理论保证。

主要贡献

  • 证明了“足够好奇心”同时确保自洽学习和无悔优化
  • 建立了AIF与贝叶斯实验设计和贝叶斯优化的联系
  • 为调整认知-实用权衡提供了实践设计指南

方法论

理论分析和证明,并结合现实世界的实验进行验证,将AIF框架与贝叶斯理论相结合。

原文摘要

Active inference (AIF) unifies exploration and exploitation by minimizing the Expected Free Energy (EFE), balancing epistemic value (information gain) and pragmatic value (task performance) through a curiosity coefficient. Yet it has been unclear when this balance yields both coherent learning and efficient decision-making: insufficient curiosity can drive myopic exploitation and prevent uncertainty resolution, while excessive curiosity can induce unnecessary exploration and regret. We establish the first theoretical guarantee for EFE-minimizing agents, showing that a single requirement--sufficient curiosity--simultaneously ensures self-consistent learning (Bayesian posterior consistency) and no-regret optimization (bounded cumulative regret). Our analysis characterizes how this mechanism depends on initial uncertainty, identifiability, and objective alignment, thereby connecting AIF to classical Bayesian experimental design and Bayesian optimization within one theoretical framework. We further translate these theories into practical design guidelines for tuning the epistemic-pragmatic trade-off in hybrid learning-optimization problems, validated through real-world experiments.

标签

主动推理 强化学习 贝叶斯优化 实验设计

arXiv 分类

cs.LG