Spectrally-Guided Diffusion Noise Schedules
AI 摘要
提出了一种基于图像频谱特性的像素扩散模型噪声调度方法,提高了生成质量。
主要贡献
- 提出了基于图像频谱特性的噪声调度方法
- 推导了最小和最大噪声水平的理论界限
- 提出了条件采样噪声调度方法
方法论
基于图像频谱属性,推导噪声水平的理论界限,设计“紧凑”噪声调度,并条件采样。
原文摘要
Denoising diffusion models are widely used for high-quality image and video generation. Their performance depends on noise schedules, which define the distribution of noise levels applied during training and the sequence of noise levels traversed during sampling. Noise schedules are typically handcrafted and require manual tuning across different resolutions. In this work, we propose a principled way to design per-instance noise schedules for pixel diffusion, based on the image's spectral properties. By deriving theoretical bounds on the efficacy of minimum and maximum noise levels, we design ``tight'' noise schedules that eliminate redundant steps. During inference, we propose to conditionally sample such noise schedules. Experiments show that our noise schedules improve generative quality of single-stage pixel diffusion models, particularly in the low-step regime.