AI Agents 相关度: 8/10

Continual learning and refinement of causal models through dynamic predicate invention

Enrique Crespo-Fernandez, Oliver Ray, Telmo de Menezes e Silva Filho, Peter Flach
arXiv: 2602.17217v1 发布: 2026-02-19 更新: 2026-02-19

AI 摘要

提出一种通过动态谓词发明,在线学习和优化因果模型的框架,提升智能体在复杂环境下的性能。

主要贡献

  • 提出基于元解释学习和谓词发明的在线因果世界建模框架
  • 实现高效的样本利用率,优于PPO
  • 解决了复杂关系动态环境下的组合爆炸问题

方法论

结合连续模型学习与修复,利用元解释学习和谓词发明构建符号因果世界模型,实现概念分层。

原文摘要

Efficiently navigating complex environments requires agents to internalize the underlying logic of their world, yet standard world modelling methods often struggle with sample inefficiency, lack of transparency, and poor scalability. We propose a framework for constructing symbolic causal world models entirely online by integrating continuous model learning and repair into the agent's decision loop, by leveraging the power of Meta-Interpretive Learning and predicate invention to find semantically meaningful and reusable abstractions, allowing an agent to construct a hierarchy of disentangled, high-quality concepts from its observations. We demonstrate that our lifted inference approach scales to domains with complex relational dynamics, where propositional methods suffer from combinatorial explosion, while achieving sample-efficiency orders of magnitude higher than the established PPO neural-network-based baseline.

标签

因果模型 持续学习 元学习 谓词发明 强化学习

arXiv 分类

cs.AI