Minimax estimation of kernel mean embeddings

Authors Ilya Tolstikhin, Krikamol Muandet, Bharath K. Sriperumbudur

Publication date 17 April 2018

Full work available at URL https://arxiv.org/abs/1602.04361, http://jmlr.csail.mit.edu/papers/v18/17-032.html

reproducing kernel Hilbert space Bochner integral Bochner's theorem minimax lower bounds translation invariant kernel kernel mean embeddings

Mathematics Subject Classification ID

Estimation in multivariate analysis (62H12) Minimax procedures in statistical decision theory (62C20) Hilbert spaces with reproducing kernels (= (proper) functional Hilbert spaces, including de Branges-Rovnyak and other structured spaces) (46E22)

Abstract: In this paper, we study the minimax estimation of the Bochner integral

mu_k(P):=int_{mathcal{X}} k(cdot,x),dP(x),

also called as the kernel mean embedding, based on random samples drawn i.i.d.~from

P

, where

k : m a t h c a l X i m e s m a t h c a l X i g h t a r r o w m a t h b b R

is a positive definite kernel. Various estimators (including the empirical estimator),

h a t {h e t a}_{n}

of

m u_{k} (P)

are studied in the literature wherein all of them satisfy

with

m a t h c a l H_{k}

being the reproducing kernel Hilbert space induced by

k

. The main contribution of the paper is in showing that the above mentioned rate of

n^{- 1 / 2}

is minimax in

| c d o t |_{m a t h c a l H_{k}}

and

| c d o t |_{L^{2} (m a t h b b R^{d})}

-norms over the class of discrete measures and the class of measures that has an infinitely differentiable density, with

k

being a continuous translation-invariant kernel on

m a t h b b R^{d}

. The interesting aspect of this result is that the minimax rate is independent of the smoothness of the kernel and the density of

P

(if it exists). This result has practical consequences in statistical applications as the mean embedding has been widely employed in non-parametric hypothesis testing, density estimation, causal inference and feature selection, through its relation to energy distance (and distance covariance).

Recommendations

Cites work

Cited in

(11)

This page was built for publication: Minimax estimation of kernel mean embeddings

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4636999)