Tighter low-rank approximation via sampling the leveraged element

DOI10.1137/1.9781611973730.62zbMATH Open1371.68320arXiv1410.3886OpenAlexW2951791000MaRDI QIDQ5362997FDOQ5362997

Authors: Srinadh Bhojanapalli, Prateek Jain, Sujay Sanghavi

Publication date: 5 October 2017

Published in: Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms (Search for Journal in Brave)

Abstract: In this work, we propose a new randomized algorithm for computing a low-rank approximation to a given matrix. Taking an approach different from existing literature, our method first involves a specific biased sampling, with an element being chosen based on the leverage scores of its row and column, and then involves weighted alternating minimization over the factored form of the intended low-rank matrix, to minimize error only on these samples. Our method can leverage input sparsity, yet produce approximations in {em spectral} (as opposed to the weaker Frobenius) norm; this combines the best aspects of otherwise disparate current results, but with a dependence on the condition number

k a p p a = s i g m a_{1} / s i g m a_{r}

. In particular we require

O (n n z (M) + f r a c n k a p p a^{2} r^{5} e p s i l o n^{2})

computations to generate a rank-

r

approximation to

M

in spectral norm. In contrast, the best existing method requires

O (n n z (M) + f r a c n r^{2} e p s i l o n^{4})

time to compute an approximation in Frobenius norm. Besides the tightness in spectral norm, we have a better dependence on the error

e p s i l o n

. Our method is naturally and highly parallelizable. Our new approach enables two extensions that are interesting on their own. The first is a new method to directly compute a low-rank approximation (in efficient factored form) to the product of two given matrices; it computes a small random set of entries of the product, and then executes weighted alternating minimization (as before) on these. The sampling strategy is different because now we cannot access leverage scores of the product matrix (but instead have to work with input matrices). The second extension is an improved algorithm with smaller communication complexity for the distributed PCA setting (where each server has small set of rows of the matrix, and want to compute low rank approximation with small amount of communication with other servers).

Full work available at URL: https://arxiv.org/abs/1410.3886

Recommendations

Mathematics Subject Classification ID

Randomized algorithms (68W20) Approximation algorithms (68W25) Distributed algorithms (68W15)

Cited In (9)

This page was built for publication: Tighter low-rank approximation via sampling the leveraged element

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5362997)