Bandits with switching costs, T^2/3 regret
DOI10.1145/2591796.2591868zbMATH Open1315.68207arXiv1310.2997OpenAlexW2109690147MaRDI QIDQ5259581FDOQ5259581
Authors: Ofer Dekel, Jian Ding, Tomer Koren, Yuval Peres
Publication date: 26 June 2015
Published in: Proceedings of the forty-sixth annual ACM symposium on Theory of computing (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1310.2997
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27) Randomized algorithms (68W20) Sums of independent random variables; random walks (60G50) Computational difficulty of problems (lower bounds, completeness, difficulty of approximation, etc.) (68Q17) Rationality and learning in game theory (91A26)
Cites Work
- Theory of Cryptography
- On the geometry of differential privacy
- Interactive privacy via the median mechanism
- The price of privately releasing contingency tables and the spectra of random matrices with correlated rows
- Lower bounds in differential privacy
- Iterative Constructions and Private Data Release
- Differential privacy and the fat-shattering dimension of linear queries
- Our Data, Ourselves: Privacy Via Distributed Noise Generation
- On the complexity of differentially private data release, efficient algorithms and hardness results
- Title not available (Why is that?)
- Collusion-secure fingerprinting for digital data
- Advances in Cryptology – CRYPTO 2004
- Title not available (Why is that?)
- New Efficient Attacks on Statistical Disclosure Control Mechanisms
- Bounds on the sample complexity for private learning and private data release
- Answering \(n^{2+o(1)}\) counting queries with differential privacy is hard
- Characterizing the sample complexity of private learners
- Faster algorithms for privately releasing marginals
- Faster private release of marginals on small databases
- Private Learning and Sanitization: Pure vs. Approximate Differential Privacy
- Efficient algorithms for privately releasing marginals via convex relaxations
Cited In (6)
- Multi-armed Bandits with Metric Switching Costs
- Average optimality in a Poissonian bandit with switching arms
- Constrained no-regret learning
- Online learning over a finite action set with limited switching
- Chasing Ghosts: Competing with Stateful Policies
- Sharp dichotomies for regret minimization in metric spaces
Uses Software
This page was built for publication: Bandits with switching costs, \(T^{2/3}\) regret
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5259581)