Multimodal Learning 相关度: 7/10

Spectrally-Guided Diffusion Noise Schedules

Carlos Esteves, Ameesh Makadia

arXiv: 2603.19222v1 发布: 2026-03-19 更新: 2026-03-19

下载 PDF arXiv 页面

AI 摘要

提出了一种基于图像频谱特性的像素扩散模型噪声调度方法，提高了生成质量。

主要贡献

提出了基于图像频谱特性的噪声调度方法
推导了最小和最大噪声水平的理论界限
提出了条件采样噪声调度方法

方法论

基于图像频谱属性，推导噪声水平的理论界限，设计“紧凑”噪声调度，并条件采样。

原文摘要

Denoising diffusion models are widely used for high-quality image and video generation. Their performance depends on noise schedules, which define the distribution of noise levels applied during training and the sequence of noise levels traversed during sampling. Noise schedules are typically handcrafted and require manual tuning across different resolutions. In this work, we propose a principled way to design per-instance noise schedules for pixel diffusion, based on the image's spectral properties. By deriving theoretical bounds on the efficacy of minimum and maximum noise levels, we design ``tight'' noise schedules that eliminate redundant steps. During inference, we propose to conditionally sample such noise schedules. Experiments show that our noise schedules improve generative quality of single-stage pixel diffusion models, particularly in the low-step regime.

arXiv 分类

cs.CV cs.LG

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类