Multimodal Learning 相关度: 8/10

ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis

Zhan Jin, Yu Luo, Yizhou Zhang, Ziyang Cui, Yuqing Wei, Xianchao Liu, Xueying Zeng, Qing Zhang
arXiv: 2603.19169v1 发布: 2026-03-19 更新: 2026-03-19

AI 摘要

ARIADNE框架通过偏好对齐感知和RL推理,提升冠状动脉造影分析的可靠性。

主要贡献

  • 提出结合DPO和Sa2VA的感知模块,利用Betti数约束进行拓扑对齐
  • 设计基于RL的推理模块,通过拒绝机制优化诊断可靠性
  • 在临床数据上验证了框架的有效性和泛化性

方法论

使用DPO微调视觉-语言模型,以拓扑约束作为偏好信号,再通过RL进行推理,优化局部化。

原文摘要

Conventional pixel-wise loss functions fail to enforce topological constraints in coronary vessel segmentation, producing fragmented vascular trees despite high pixel-level accuracy. We present ARIADNE, a two-stage framework coupling preference-aligned perception with RL-based diagnostic reasoning for topologically coherent stenosis detection. The perception module employs DPO to fine-tune the Sa2VA vision-language foundation model using Betti number constraints as preference signals, aligning the policy toward geometrically complete vessel structures rather than pixel-wise overlap metrics. The reasoning module formulates stenosis localization as a Markov Decision Process with an explicit rejection mechanism that autonomously defers ambiguous anatomical candidates such as bifurcations and vessel crossings, shifting from coverage maximization to reliability optimization. On 1,400 clinical angiograms, ARIADNE achieves state-of-the-art centerline Dice of 0.838, reduces false positives by 41% compared to geometric baselines. External validation on multi-center benchmarks ARCADE and XCAD confirms generalization across acquisition protocols. This represents the first application of DPO for topological alignment in medical imaging, demonstrating that preference-based learning over structural constraints mitigates topological violations while maintaining diagnostic sensitivity in interventional cardiology workflows.

标签

医学图像分割 强化学习 偏好学习 视觉-语言模型

arXiv 分类

cs.CV cs.AI