Multimodal Learning 相关度: 8/10

AWDiff: An a trous wavelet diffusion model for lung ultrasound image synthesis

Maryam Heidari, Nantheera Anantrasirichai, Steven Walker, Rahul Bhatnagar, Alin Achim

arXiv: 2603.03125v1 发布: 2026-03-03 更新: 2026-03-03

下载 PDF arXiv 页面

AI 摘要

提出AWDiff模型，利用小波变换和扩散模型进行肺部超声图像生成，提升图像质量。

主要贡献

提出了AWDiff模型
结合小波变换和扩散模型
利用BioMedCLIP进行语义条件约束

方法论

AWDiff模型利用a trous小波变换保留细粒度结构，结合扩散模型生成肺部超声图像，并使用BioMedCLIP进行语义约束。

原文摘要

Lung ultrasound (LUS) is a safe and portable imaging modality, but the scarcity of data limits the development of machine learning methods for image interpretation and disease monitoring. Existing generative augmentation methods, such as Generative Adversarial Networks (GANs) and diffusion models, often lose subtle diagnostic cues due to resolution reduction, particularly B-lines and pleural irregularities. We propose A trous Wavelet Diffusion (AWDiff), a diffusion based augmentation framework that integrates the a trous wavelet transform to preserve fine-scale structures while avoiding destructive downsampling. In addition, semantic conditioning with BioMedCLIP, a vision language foundation model trained on large scale biomedical corpora, enforces alignment with clinically meaningful labels. On a LUS dataset, AWDiff achieved lower distortion and higher perceptual quality compared to existing methods, demonstrating both structural fidelity and clinical diversity.

arXiv 分类

cs.CV

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类