Agent Tuning & Optimization 相关度: 7/10

Conservative Continuous-Time Treatment Optimization

Nora Schneider, Georg Manten, Niki Kilbertus
arXiv: 2603.16789v1 发布: 2026-03-17 更新: 2026-03-17

AI 摘要

提出了一种保守的连续时间治疗优化框架,通过MMD正则化限制外推。

主要贡献

  • 提出了保守的连续时间随机控制框架
  • 使用了基于签名的MMD正则化方法限制外推
  • 最小化真实成本的可计算上界

方法论

将患者动态建模为随机微分方程,利用MMD正则化限制治疗计划的轨迹分布偏离,优化目标为真实成本上界。

原文摘要

We develop a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories. The unknown patient dynamics are modeled as a controlled stochastic differential equation with treatment as a continuous-time control. Naive model-based optimization can exploit model errors and propose out-of-support controls, so optimizing the estimated dynamics may not optimize the true dynamics. To limit extrapolation, we add a consistent signature-based MMD regularizer on path space that penalizes treatment plans whose induced trajectory distribution deviates from observed trajectories. The resulting objective minimizes a computable upper bound on the true cost. Experiments on benchmark datasets show improved robustness and performance compared to non-conservative baselines.

标签

治疗优化 随机控制 MMD正则化 连续时间

arXiv 分类

cs.LG q-bio.QM