Conservative Continuous-Time Treatment Optimization
AI 摘要
提出了一种保守的连续时间治疗优化框架,通过MMD正则化限制外推。
主要贡献
- 提出了保守的连续时间随机控制框架
- 使用了基于签名的MMD正则化方法限制外推
- 最小化真实成本的可计算上界
方法论
将患者动态建模为随机微分方程,利用MMD正则化限制治疗计划的轨迹分布偏离,优化目标为真实成本上界。
原文摘要
We develop a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories. The unknown patient dynamics are modeled as a controlled stochastic differential equation with treatment as a continuous-time control. Naive model-based optimization can exploit model errors and propose out-of-support controls, so optimizing the estimated dynamics may not optimize the true dynamics. To limit extrapolation, we add a consistent signature-based MMD regularizer on path space that penalizes treatment plans whose induced trajectory distribution deviates from observed trajectories. The resulting objective minimizes a computable upper bound on the true cost. Experiments on benchmark datasets show improved robustness and performance compared to non-conservative baselines.