Streaming algorithms via precision sampling
From MaRDI portal
Publication:5494976
DOI10.1109/FOCS.2011.82zbMATH Open1292.68186arXiv1011.1263MaRDI QIDQ5494976FDOQ5494976
Authors: Alexandr Andoni, Robert Krauthgamer, Krzysztof Onak
Publication date: 30 July 2014
Published in: 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science (Search for Journal in Brave)
Abstract: A technique introduced by Indyk and Woodruff [STOC 2005] has inspired several recent advances in data-stream algorithms. We show that a number of these results follow easily from the application of a single probabilistic method called Precision Sampling. Using this method, we obtain simple data-stream algorithms that maintain a randomized sketch of an input vector , which is useful for the following applications. 1) Estimating the -moment of , for . 2) Estimating the -norm of , for , with small update time. 3) Estimating cascaded norms for all . 4) sampling, where the goal is to produce an element with probability (approximately) . It extends to similarly defined -sampling, for . For all these applications the algorithm is essentially the same: scale the vector x entry-wise by a well-chosen random vector, and run a heavy-hitter estimation algorithm on the resulting vector. Our sketch is a linear function of x, thereby allowing general updates to the vector x. Precision Sampling itself addresses the problem of estimating a sum from weak estimates of each real . More precisely, the estimator first chooses a desired precision for each , and then it receives an estimate of every within additive . Its goal is to provide a good approximation to while keeping a tab on the "approximation cost" . Here we refine previous work [Andoni, Krauthgamer, and Onak, FOCS 2010] which shows that as long as , a good multiplicative approximation can be achieved using total precision of only .
Full work available at URL: https://arxiv.org/abs/1011.1263
Recommendations
- 1-pass relative-error \(L_p\)-sampling with applications
- On Estimating Frequency Moments of Data Streams
- Optimal approximations of the frequency moments of data streams
- An improved data stream summary: the count-min sketch and its applications
- Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques
Probability in computer science (algorithm analysis, random structures, phase transitions, etc.) (68Q87) Analysis of algorithms (68W40)
Cited In (23)
- Title not available (Why is that?)
- 1-pass relative-error \(L_p\)-sampling with applications
- Precision vs confidence tradeoffs for \(\ell_2\)-based frequency estimation in data streams
- Towards Optimal Moment Estimation in Streaming and Distributed Models
- Quantum Chebyshev's Inequality and Applications
- On approximating matrix norms in data streams
- Hierarchical sampling from sketches: Estimating functions over data streams
- Perfect \(L_p\) sampling in a data stream
- Secure sampling with sublinear communication
- A general method for estimating correlated aggregates over a data stream
- Efficient sampling of non-strict turnstile data streams
- Optimal Random Sampling from Distributed Streams Revisited
- Taylor polynomial estimator for estimating frequency moments
- Streaming Euclidean MST to a constant factor
- Continuous monitoring of \(\ell_p\) norms in data streams
- Revisiting frequency moment estimation in random order streams
- On sketching the \(q\) to \(p\) norms
- Deterministic heavy hitters with sublinear query time
- Data streams and applications in computer science
- Sublinear algorithms for MAXCUT and correlation clustering
- The Simultaneous Communication of Disjointness with Applications to Data Streams
- Streaming Algorithms Measured in Terms of the Computed Quantity
- Towards Optimal Moment Estimation in Streaming and Distributed Models
This page was built for publication: Streaming algorithms via precision sampling
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5494976)