Multimodal Learning 相关度: 9/10

Learning from Limited and Incomplete Data: A Multimodal Framework for Predicting Pathological Response in NSCLC

Alice Natalina Caragliano, Giulia Farina, Fatih Aksu, Camillo Maria Caruso, Claudia Tacconi, Carlo Greco, Lorenzo Nibid, Edy Ippolito, Michele Fiore, Giuseppe Perrone, Sara Ramella, Paolo Soda, Valerio Guarrasi
arXiv: 2603.15100v1 发布: 2026-03-16 更新: 2026-03-16

AI 摘要

该论文提出一种多模态深度学习框架,利用CT影像和临床数据预测NSCLC新辅助治疗后的主要病理反应。

主要贡献

  • 提出基于基础模型CT特征提取方法
  • 设计缺失感知架构处理不完整的临床数据
  • 采用加权融合机制整合影像和临床模态

方法论

利用深度学习,结合基础模型提取CT特征,设计缺失感知架构处理临床数据,并通过加权融合进行多模态信息融合。

原文摘要

Major pathological response (pR) following neoadjuvant therapy is a clinically meaningful endpoint in non-small cell lung cancer, strongly associated with improved survival. However, accurate preoperative prediction of pR remains challenging, particularly in real-world clinical settings characterized by limited data availability and incomplete clinical profiles. In this study, we propose a multimodal deep learning framework designed to address these constraints by integrating foundation model-based CT feature extraction with a missing-aware architecture for clinical variables. This approach enables robust learning from small cohorts while explicitly modeling missing clinical information, without relying on conventional imputation strategies. A weighted fusion mechanism is employed to leverage the complementary contributions of imaging and clinical modalities, yielding a multimodal model that consistently outperforms both unimodal imaging and clinical baselines. These findings underscore the added value of integrating heterogeneous data sources and highlight the potential of multimodal, missing-aware systems to support pR prediction under realistic clinical conditions.

标签

多模态学习 深度学习 医学影像 肺癌 病理反应预测

arXiv 分类

cs.CV