AI Agents 相关度: 8/10

Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning

Joydeep Chandra, Satyam Kumar Navneet, Aleksandr Algazinov, Yong Zhang
arXiv: 2602.04821v1 发布: 2026-02-04 更新: 2026-02-04

AI 摘要

STREAM-RL框架通过不确定性感知方法实现安全可靠的城市交通控制。

主要贡献

  • PU-GAT+:不确定性引导的自适应共形预测器
  • CRFN-BY:基于共形残差流网络的不确定性建模
  • LyCon-WRL+:基于李雅普诺夫稳定的安全世界模型强化学习

方法论

结合共形预测、残差流网络和强化学习,利用不确定性信息指导预测、异常检测和安全策略学习。

原文摘要

Urban traffic management demands systems that simultaneously predict future conditions, detect anomalies, and take safe corrective actions -- all while providing reliability guarantees. We present STREAM-RL, a unified framework that introduces three novel algorithmic contributions: (1) PU-GAT+, an Uncertainty-Guided Adaptive Conformal Forecaster that uses prediction uncertainty to dynamically reweight graph attention via confidence-monotonic attention, achieving distribution-free coverage guarantees; (2) CRFN-BY, a Conformal Residual Flow Network that models uncertainty-normalized residuals via normalizing flows with Benjamini-Yekutieli FDR control under arbitrary dependence; and (3) LyCon-WRL+, an Uncertainty-Guided Safe World-Model RL agent with Lyapunov stability certificates, certified Lipschitz bounds, and uncertainty-propagated imagination rollouts. To our knowledge, this is the first framework to propagate calibrated uncertainty from forecasting through anomaly detection to safe policy learning with end-to-end theoretical guarantees. Experiments on multiple real-world traffic trajectory data demonstrate that STREAM-RL achieves 91.4\% coverage efficiency, controls FDR at 4.1\% under verified dependence, and improves safety rate to 95.2\% compared to 69\% for standard PPO while achieving higher reward, with 23ms end-to-end inference latency.

标签

交通控制 强化学习 不确定性量化 共形预测

arXiv 分类

cs.LG cs.AI