Multimodal Learning 相关度: 9/10

MieDB-100k: A Comprehensive Dataset for Medical Image Editing

Yongfan Lai, Wen Qian, Bo Liu, Hongyan Li, Hao Luo, Fan Wang, Bohan Zhuang, Shenda Hong
arXiv: 2602.09587v1 发布: 2026-02-10 更新: 2026-02-10

AI 摘要

MieDB-100k是一个大规模、高质量的医学图像编辑数据集,促进医学图像编辑模型的发展。

主要贡献

  • 构建大规模、高质量、多样化的医学图像编辑数据集MieDB-100k
  • 提出包含感知、修改和转换三种编辑任务的数据集分类方法
  • 证明了在MieDB-100k上训练的模型优于现有模型

方法论

通过专家模型和规则的数据合成方法,结合人工审核构建数据集,确保临床保真度。

原文摘要

The scarcity of high-quality data remains a primary bottleneck in adapting multimodal generative models for medical image editing. Existing medical image editing datasets often suffer from limited diversity, neglect of medical image understanding and inability to balance quality with scalability. To address these gaps, we propose MieDB-100k, a large-scale, high-quality and diverse dataset for text-guided medical image editing. It categorizes editing tasks into perspectives of Perception, Modification and Transformation, considering both understanding and generation abilities. We construct MieDB-100k via a data curation pipeline leveraging both modality-specific expert models and rule-based data synthetic methods, followed by rigorous manual inspection to ensure clinical fidelity. Extensive experiments demonstrate that model trained with MieDB-100k consistently outperform both open-source and proprietary models while exhibiting strong generalization ability. We anticipate that this dataset will serve as a cornerstone for future advancements in specialized medical image editing.

标签

医学图像编辑 数据集 多模态学习 文本引导编辑

arXiv 分类

cs.CV cs.AI