Nonparametric Bayesian aggregation for massive data

Authors Zuofeng Shang, Botao Hao, Guang Cheng

Publication date 7 February 2020

Full work available at URL https://arxiv.org/abs/1508.04175, http://jmlr.csail.mit.edu/papers/v20/17-641.html

nonparametric Bayesian inference Gaussian process prior linear functional divide-and-conquer credible region

Nonparametric estimation (62G05) Statistical aspects of big data and data science (62R07) Gaussian processes (60G15) Nonparametric tolerance and confidence regions (62G15)

Abstract: We develop a set of scalable Bayesian inference procedures for a general class of nonparametric regression models. Specifically, nonparametric Bayesian inferences are separately performed on each subset randomly split from a massive dataset, and then the obtained local results are aggregated into global counterparts. This aggregation step is explicit without involving any additional computation cost. By a careful partition, we show that our aggregated inference results obtain an oracle rule in the sense that they are equivalent to those obtained directly from the entire data (which are computationally prohibitive). For example, an aggregated credible ball achieves desirable credibility level and also frequentist coverage while possessing the same radius as the oracle ball.

Recommendations

Cites work

Cited in

(15)

This page was built for publication: Nonparametric Bayesian aggregation for massive data

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5214232)