LLM Reasoning 相关度: 6/10

D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding

Jonathan Lys, Vincent Gripon, Bastien Pasdeloup, Axel Marmoret, Lukas Mauch, Fabien Cardinaux, Ghouthi Boukli Hacene
arXiv: 2603.19146v1 发布: 2026-03-19 更新: 2026-03-19

AI 摘要

D5P4是一种基于行列式点过程的并行离散扩散解码方法,旨在提升生成文本的多样性。

主要贡献

  • 提出了离散扩散模型的广义beam-search框架
  • 设计了基于行列式点过程(DPP)的D5P4多样性选择算法
  • 实现了多GPU兼容和高效的计算
  • 实验证明D5P4在多样性和质量上优于现有方法

方法论

D5P4利用DPP模型进行beam选择,通过贪心求解MAP推断,在模型概率和目标多样性之间进行权衡。

原文摘要

Discrete diffusion models are promising alternatives to autoregressive approaches for text generation, yet their decoding methods remain under-studied. Standard decoding methods for autoregressive models, such as beam search, do not directly apply to iterative denoising, and existing diffusion decoding techniques provide limited control over in-batch diversity. To bridge this gap, we introduce a generalized beam-search framework for discrete diffusion that generates candidates in parallel and supports modular beam-selection objectives. As a diversity-focused instantiation, we propose D5P4, which formulates the selection step as MAP inference over a Determinantal Point Process. Leveraging a scalable greedy solver, D5P4 maintains multi-GPU compatibility and enables an explicit trade-off between model probability and target diversity with near-zero compute overhead. Experiments on free-form generation and question answering demonstrate that D5P4 improves diversity over strong baselines while maintaining competitive generation quality.

标签

离散扩散模型 文本生成 解码方法 行列式点过程 多样性

arXiv 分类

cs.AI cs.LG