The zig-zag process and super-efficient sampling for Bayesian analysis of big data

From MaRDI portal
Publication:144289

DOI10.1214/18-AOS1715zbMATH Open1417.65008arXiv1607.03188OpenAlexW3104370808WikidataQ128371771 ScholiaQ128371771MaRDI QIDQ144289FDOQ144289


Authors: Joris Bierkens, Paul Fearnhead, Gareth Roberts, Joris Bierkens, Paul Fearnhead, Gareth O. Roberts Edit this on Wikidata


Publication date: 11 July 2016

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: Standard MCMC methods can scale poorly to big data settings due to the need to evaluate the likelihood at each iteration. There have been a number of approximate MCMC algorithms that use sub-sampling ideas to reduce this computational burden, but with the drawback that these algorithms no longer target the true posterior distribution. We introduce a new family of Monte Carlo methods based upon a multi-dimensional version of the Zig-Zag process of (Bierkens, Roberts, 2017), a continuous time piecewise deterministic Markov process. While traditional MCMC methods are reversible by construction (a property which is known to inhibit rapid convergence) the Zig-Zag process offers a flexible non-reversible alternative which we observe to often have favourable convergence properties. We show how the Zig-Zag process can be simulated without discretisation error, and give conditions for the process to be ergodic. Most importantly, we introduce a sub-sampling version of the Zig-Zag process that is an example of an {em exact approximate scheme}, i.e. the resulting approximate process still has the posterior as its stationary distribution. Furthermore, if we use a control-variate idea to reduce the variance of our unbiased estimator, then the Zig-Zag process can be super-efficient: after an initial pre-processing step, essentially independent samples from the posterior distribution are obtained at a computational cost which does not depend on the size of the data.


Full work available at URL: https://arxiv.org/abs/1607.03188




Recommendations





Cited In (86)





This page was built for publication: The zig-zag process and super-efficient sampling for Bayesian analysis of big data

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q144289)