Chasing Ghosts: Competing with Stateful Policies
DOI10.1137/14100227XzbMATH Open1410.91168arXiv1407.7635MaRDI QIDQ2968152FDOQ2968152
Authors: Tomer Koren, Moshe Tennenholtz, Uriel Feige
Publication date: 10 March 2017
Published in: SIAM Journal on Computing (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1407.7635
Recommendations
- State-policy dynamics in evolutionary games
- Foundations of Software Science and Computation Structures
- Extreme state aggregation beyond MDPs
- On the Complexity of Reasoning About Dynamic Policies
- A state space distribution policy based on abstract interpretation
- State observation accuracy and finite-memory policy performance
- Learning with policy prediction in continuous state-action multi-agent decision processes
Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27) Decision theory (91B06) Analysis of algorithms and problem complexity (68Q25) Probabilistic games; gambling (91A60)
Cites Work
- A decision-theoretic generalization of on-line learning and an application to boosting
- Markov chains and mixing times. With a chapter on ``Coupling from the past by James G. Propp and David B. Wilson.
- Prediction, Learning, and Games
- Algorithmic Game Theory
- The Nonstochastic Multiarmed Bandit Problem
- How to use expert advice
- Universal prediction of individual sequences
- Universal prediction
- Combining expert advice in reactive environments
- On sequential strategies for loss functions with memory
- Online Markov decision processes
- Bandits with switching costs, \(T^{2/3}\) regret
- Differential privacy under continual observation
- Online Markov Decision Processes Under Bandit Feedback
- Why are images smooth?
- High-confidence predictions under adversarial uncertainty
Cited In (3)
Uses Software
This page was built for publication: Chasing Ghosts: Competing with Stateful Policies
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2968152)