Low-distortion subspace embeddings in input-sparsity time and applications to robust linear regression

From MaRDI portal
Publication:5495779

DOI10.1145/2488608.2488621zbMATH Open1293.68150arXiv1210.3135OpenAlexW2134342155MaRDI QIDQ5495779FDOQ5495779


Authors: Xiangrui Meng, Michael W. Mahoney Edit this on Wikidata


Publication date: 7 August 2014

Published in: Proceedings of the forty-eighth annual ACM symposium on Theory of Computing (Search for Journal in Brave)

Abstract: Low-distortion embeddings are critical building blocks for developing random sampling and random projection algorithms for linear algebra problems. We show that, given a matrix AinRnimesd with nggd and a pin[1,2), with a constant probability, we can construct a low-distortion embedding matrix PiinRO(poly(d))imesn that embeds Ap, the ellp subspace spanned by A's columns, into (RO(poly(d)),|cdot|p); the distortion of our embeddings is only O(poly(d)), and we can compute PiA in O(nz(A)) time, i.e., input-sparsity time. Our result generalizes the input-sparsity time ell2 subspace embedding by Clarkson and Woodruff [STOC'13]; and for completeness, we present a simpler and improved analysis of their construction for ell2. These input-sparsity time ellp embeddings are optimal, up to constants, in terms of their running time; and the improved running time propagates to applications such as (1pmepsilon)-distortion ellp subspace embedding and relative-error ellp regression. For ell2, we show that a (1+epsilon)-approximate solution to the ell2 regression problem specified by the matrix A and a vector binRn can be computed in O(nz(A)+d3log(d/epsilon)/epsilon2) time; and for ellp, via a subspace-preserving sampling procedure, we show that a (1pmepsilon)-distortion embedding of Ap into RO(poly(d)) can be computed in O(nz(A)cdotlogn) time, and we also show that a (1+epsilon)-approximate solution to the ellp regression problem minxinRd|Axb|p can be computed in O(nz(A)cdotlogn+poly(d)log(1/epsilon)/epsilon2) time. Moreover, we can improve the embedding dimension or equivalently the sample size to O(d3+p/2log(1/epsilon)/epsilon2) without increasing the complexity.


Full work available at URL: https://arxiv.org/abs/1210.3135




Recommendations





Cited In (42)

Uses Software





This page was built for publication: Low-distortion subspace embeddings in input-sparsity time and applications to robust linear regression

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5495779)