Square Hellinger Subadditivity for Bayesian Networks and its Applications to Identity Testing

From MaRDI portal
Publication:6280652

arXiv1612.03164MaRDI QIDQ6280652FDOQ6280652


Authors: Constantinos Daskalakis Edit this on Wikidata


Publication date: 9 December 2016

Abstract: We show that the square Hellinger distance between two Bayesian networks on the same directed graph, G, is subadditive with respect to the neighborhoods of G. Namely, if P and Q are the probability distributions defined by two Bayesian networks on the same DAG, our inequality states that the square Hellinger distance, H2(P,Q), between P and Q is upper bounded by the sum, sumvH2(PvcupPiv,QvcupPiv), of the square Hellinger distances between the marginals of P and Q on every node v and its parents Piv in the DAG. Importantly, our bound does not involve the conditionals but the marginals of P and Q. We derive a similar inequality for more general Markov Random Fields. As an application of our inequality, we show that distinguishing whether two Bayesian networks P and Q on the same (but potentially unknown) DAG satisfy P=Q vs dmTV(P,Q)>epsilon can be performed from ildeO(|Sigma|3/4(d+1)cdotn/epsilon2) samples, where d is the maximum in-degree of the DAG and Sigma the domain of each variable of the Bayesian networks. If P and Q are defined on potentially different and potentially unknown trees, the sample complexity becomes ildeO(|Sigma|4.5n/epsilon2), whose dependence on n,epsilon is optimal up to logarithmic factors. Lastly, if P and Q are product distributions over 0,1n and Q is known, the sample complexity becomes O(sqrtn/epsilon2), which is optimal up to constant factors.













This page was built for publication: Square Hellinger Subadditivity for Bayesian Networks and its Applications to Identity Testing

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6280652)