Probabilistic Polynomials and Hamming Nearest Neighbors

DOI10.1109/FOCS.2015.18arXiv1507.05106MaRDI QIDQ6263712FDOQ6263712

Publication date: 17 July 2015

Abstract: We show how to compute any symmetric Boolean function on

n

variables over any field (as well as the integers) with a probabilistic polynomial of degree

O (s q r t n l o g (1 / e p s i l o n))

and error at most

e p s i l o n

. The degree dependence on

n

and

e p s i l o n

is optimal, matching a lower bound of Razborov (1987) and Smolensky (1987) for the MAJORITY function. The proof is constructive: a low-degree polynomial can be efficiently sampled from the distribution. This polynomial construction is combined with other algebraic ideas to give the first subquadratic time algorithm for computing a (worst-case) batch of Hamming distances in superlogarithmic dimensions, exactly. To illustrate, let

c (n) : m a t h b b N i g h t a r r o w m a t h b b N

. Suppose we are given a database

D

of

n

vectors in

{0, 1}^{c (n) l o g n}

and a collection of

n

query vectors

Q

in the same dimension. For all

u i n Q

, we wish to compute a

v i n D

with minimum Hamming distance from

u

. We solve this problem in

n^{2 - 1 / O (c (n) l o g^{2} c (n))}

randomized time. Hence, the problem is in "truly subquadratic" time for

O (l o g n)

dimensions, and in subquadratic time for

d = o ((l o g^{2} n) / (l o g l o g n)^{2})

. We apply the algorithm to computing pairs with maximum inner product, closest pair in

e l l_{1}

for vectors with bounded integer entries, and pairs with maximum Jaccard coefficients.

This page was built for publication: Probabilistic Polynomials and Hamming Nearest Neighbors

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6263712)