Multimodal Learning 相关度: 8/10

RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation

Yixin Chen, Ziyu Su, Hikmat Khan, Muhammad Khalid Khan Niazi
arXiv: 2603.04348v1 发布: 2026-03-04 更新: 2026-03-04

AI 摘要

RANGER模型通过专家混合和自适应检索重排序,提升病理报告生成的质量。

主要贡献

  • 提出了一种基于稀疏门控专家混合(MoE)的病理报告生成框架RANGER。
  • 引入自适应检索重排序模块,减少噪声并改善语义对齐。
  • 在PathText-BRCA数据集上取得了显著的性能提升。

方法论

使用稀疏门控MoE进行动态专家选择,并结合自适应检索重排序模块优化知识库检索结果。

原文摘要

Pathology report generation remains a relatively under-explored downstream task, primarily due to the gigapixel scale and complex morphological heterogeneity of Whole Slide Images (WSIs). Existing pathology report generation frameworks typically employ transformer architectures, relying on a homogeneous decoder architecture and static knowledge retrieval integration. Such architectures limit generative specialization and may introduce noisy external guidance during the report generation process. To address these limitations, we propose RANGER, a sparsely-gated Mixture-of-Experts (MoE) framework with adaptive retrieval re-ranking for pathology report generation. Specifically, we integrate a sparsely gated MoE into the decoder, along with noisy top-$k$ routing and load-balancing regularization, to enable dynamic expert specialization across various diagnostic patterns. Additionally, we introduce an adaptive retrieval re-ranking module that selectively refines retrieved memory from a knowledge base before integration, reducing noise and improving semantic alignment based on visual feature representations. We perform extensive experiments on the PathText-BRCA dataset and demonstrate consistent improvements over existing approaches across standard natural language generation metrics. Our full RANGER model achieves optimal performance on PathText dataset, reaching BLEU-1 to BLEU-4 scores of 0.4598, 0.3044, 0.2036, and 0.1435, respectively, with METEOR of 0.1883, and ROUGE-L of 0.3038, validating the effectiveness of dynamic expert routing and adaptive knowledge refinement for semantically grounded pathology report generation.

标签

病理报告生成 混合专家模型 检索重排序 医学图像处理

arXiv 分类

cs.CV cs.AI