Optimal fast Johnson-Lindenstrauss embeddings for large data sets

DOI10.1007/S43670-021-00003-5MaRDI QIDQ2059797zbMATH OpenOpenAlexFDO

Authors Stefan Bamberger, Felix Krahmer

Publication date 14 December 2021

Published in Sampling Theory, Signal Processing, and Data Analysis (Search for Journal in Brave)

Copyright license Creative Commons Attribution 4.0 International

Full work available at URL https://arxiv.org/abs/1712.01774

zbMATH Keywords

restricted isometry property fast matrix multiplication Johnson-Lindenstrauss embeddings Hadamard transforms

Mathematics Subject Classification ID

Signal theory (characterization, reconstruction, filtering, etc.) (94A12) Analysis of algorithms and problem complexity (68Q25) Randomized algorithms (68W20) Orthogonal matrices (15B10) Informational aspects of data analysis and big data (94A16)

Abstract: Johnson-Lindenstrauss embeddings are widely used to reduce the dimension and thus the processing time of data. To reduce the total complexity, also fast algorithms for applying these embeddings are necessary. To date, such fast algorithms are only available either for a non-optimal embedding dimension or up to a certain threshold on the number of data points. We address a variant of this problem where one aims to simultaneously embed larger subsets of the data set. Our method follows an approach by Nelson: A subsampled Hadamard transform maps points into a space of lower, but not optimal dimension. Subsequently, a random matrix with independent entries projects to an optimal embedding dimension. For subsets whose size scales at least polynomially in the ambient dimension, the complexity of this method comes close to the number of operations just to read the data under mild assumptions on the size of the data set that are considerably less restrictive than in previous works. We also prove a lower bound showing that subsampled Hadamard matrices alone cannot reach an optimal embedding dimension. Hence, the second embedding cannot be omitted.

Recommendations

Cites work

Cited in

(4)

This page was built for publication: Optimal fast Johnson-Lindenstrauss embeddings for large data sets

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2059797)