D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding
AI 摘要
D5P4是一种基于行列式点过程的并行离散扩散解码方法,旨在提升生成文本的多样性。
主要贡献
- 提出了离散扩散模型的广义beam-search框架
- 设计了基于行列式点过程(DPP)的D5P4多样性选择算法
- 实现了多GPU兼容和高效的计算
- 实验证明D5P4在多样性和质量上优于现有方法
方法论
D5P4利用DPP模型进行beam选择,通过贪心求解MAP推断,在模型概率和目标多样性之间进行权衡。
原文摘要
Discrete diffusion models are promising alternatives to autoregressive approaches for text generation, yet their decoding methods remain under-studied. Standard decoding methods for autoregressive models, such as beam search, do not directly apply to iterative denoising, and existing diffusion decoding techniques provide limited control over in-batch diversity. To bridge this gap, we introduce a generalized beam-search framework for discrete diffusion that generates candidates in parallel and supports modular beam-selection objectives. As a diversity-focused instantiation, we propose D5P4, which formulates the selection step as MAP inference over a Determinantal Point Process. Leveraging a scalable greedy solver, D5P4 maintains multi-GPU compatibility and enables an explicit trade-off between model probability and target diversity with near-zero compute overhead. Experiments on free-form generation and question answering demonstrate that D5P4 improves diversity over strong baselines while maintaining competitive generation quality.