Variational inference via radial transport
AI 摘要
radVI算法通过优化径向轮廓改进变分推断,提升高维分布近似的准确性。
主要贡献
- 提出了一种新的变分推断算法radVI
- 为radVI提供了理论收敛保证
- 利用Wasserstein空间和径向传输映射理论进行优化
方法论
通过优化径向轮廓解决变分推断中高斯分布无法捕捉真实分布径向特征的问题,作为现有VI方法的补充。
原文摘要
In variational inference (VI), the practitioner approximates a high-dimensional distribution $π$ with a simple surrogate one, often a (product) Gaussian distribution. However, in many cases of practical interest, Gaussian distributions might not capture the correct radial profile of $π$, resulting in poor coverage. In this work, we approach the VI problem from the perspective of optimizing over these radial profiles. Our algorithm radVI is a cheap, effective add-on to many existing VI schemes, such as Gaussian (mean-field) VI and Laplace approximation. We provide theoretical convergence guarantees for our algorithm, owing to recent developments in optimization over the Wasserstein space--the space of probability distributions endowed with the Wasserstein distance--and new regularity properties of radial transport maps in the style of Caffarelli (2000).