AI Agents 相关度: 7/10

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Klemens Iten, Bruce Lee, Chenhao Li, Lenart Treven, Andreas Krause, Bhavya Sukhija
arXiv: 2604.02260v1 发布: 2026-04-02 更新: 2026-04-02

AI 摘要

研究时变动态系统下的强化学习控制问题,提出一种基于模型的自适应数据缓存算法。

主要贡献

  • 分析了时变动态下的强化学习问题
  • 提出了基于高斯过程模型的变分预算假设
  • 设计了一种自适应数据缓存的乐观模型强化学习算法

方法论

使用高斯过程建立动态模型,并结合变分预算假设和乐观策略,进行模型强化学习,同时引入自适应数据缓存机制。

原文摘要

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under time-varying dynamics. We consider a continual model-based reinforcement learning setting in which an agent repeatedly learns and controls a dynamical system whose transition dynamics evolve across episodes. We analyze the problem using Gaussian process dynamics models under frequentist variation-budget assumptions. Our analysis shows that persistent non-stationarity requires explicitly limiting the influence of outdated data to maintain calibrated uncertainty and meaningful dynamic regret guarantees. Motivated by these insights, we propose a practical optimistic model-based reinforcement learning algorithm with adaptive data buffer mechanisms and demonstrate improved performance on continuous control benchmarks with non-stationary dynamics.

标签

强化学习 模型强化学习 时变动态 高斯过程 自适应控制

arXiv 分类

cs.LG cs.RO