ForwardFlow: Simulation only statistical inference using deep learning

AI 摘要

提出ForwardFlow，一种基于深度学习的仅模拟统计推断方法，利用神经网络学习参数估计。

主要贡献

提出基于summary网络的频繁主义模型
设计包含collapse层的分支网络结构
验证参数估计的有限样本精确性、鲁棒性和算法逼近性

方法论

训练神经网络，输入模拟数据集和参数，最小化学习到的summary和参数之间的均方误差，从而解决参数估计的逆问题。

原文摘要

Deep learning models are being used for the analysis of parametric statistical models based on simulation-only frameworks. Bayesian models using normalizing flows simulate data from a prior distribution and are composed of two deep neural networks: a summary network that learns a sufficient statistic for the parameter and a normalizing flow that conditional on the summary network can approximate the posterior distribution. Here, we explore frequentist models that are based on a single summary network. During training, input of the network is a simulated data set based on a parameter and the loss function minimizes the mean-square error between learned summary and parameter. The network thereby solves the inverse problem of parameter estimation. We propose a branched network structure that contains collapsing layers that reduce a data set to summary statistics that are further mapped through fully connected layers to approximate the parameter estimate. We motivate our choice of network structure by theoretical considerations. In simulations we demonstrate three desirable properties of parameter estimates: finite sample exactness, robustness to data contamination, and algorithm approximation. These properties are achieved offering the the network varying sample size, contaminated data, and data needing algorithmic reconstruction during the training phase. In our simulations an EM-algorithm for genetic data is automatically approximated by the network. Simulation only approaches seem to offer practical advantages in complex modeling tasks where the simpler data simulation part is left to the researcher and the more complex problem of solving the inverse problem is left to the neural network. Challenging future work includes offering pre-trained models that can be used in a wide variety of applications.

arXiv 分类

math.ST cs.LG cs.NE stat.CO

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类