LLM Reasoning 相关度: 6/10

Simulation-Based Inference via Regression Projection and Batched Discrepancies

Arya Farahi, Jonah Rose, Paul Torrey
arXiv: 2602.03613v1 发布: 2026-02-03 更新: 2026-02-03

AI 摘要

提出一种基于回归投影和批量差异的模拟推断方法,加速参数推断并分析其局限性。

主要贡献

  • 提出基于回归投影的轻量级模拟推断方法
  • 证明该方法的一致性和稳定性
  • 分析了该方法在点识别和集合识别方面的渐近性质

方法论

通过回归拟合代理模型,模拟小批量数据,并基于残差计算权重,构建自归一化伪后验。

原文摘要

We analyze a lightweight simulation-based inference method that infers simulator parameters using only a regression-based projection of the observed data. After fitting a surrogate linear regression once, the procedure simulates small batches at the proposed parameter values and assigns kernel weights based on the resulting batch-residual discrepancy, producing a self-normalized pseudo-posterior that is simple, parallelizable, and requires access only to the fitted regression coefficients rather than raw observations. We formalize the construction as an importance-sampling approximation to a population target that averages over simulator randomness, prove consistency as the number of parameter draws grows, and establish stability in estimating the surrogate regression from finite samples. We then characterize the asymptotic concentration as the batch size increases and the bandwidth shrinks, showing that the pseudo-posterior concentrates on an identified set determined by the chosen projection, thereby clarifying when the method yields point versus set identification. Experiments on a tractable nonlinear model and on a cosmological calibration task using the DREAMS simulation suite illustrate the computational advantages of regression-based projections and the identifiability limitations arising from low-information summaries.

标签

simulation-based inference regression importance sampling parameter inference

arXiv 分类

stat.ME cs.LG stat.ML