Publication:4633809
From MaRDI portal
zbMath1425.91084MaRDI QIDQ4633809
Publication date: 6 May 2019
Full work available at URL: https://dl.acm.org/citation.cfm?id=1496775
91B06: Decision theory
68T05: Learning and adaptive systems in artificial intelligence
90C40: Markov and semi-Markov decision processes
91A60: Probabilistic games; gambling
Related Items
Weighted last-step min-max algorithm with improved sub-logarithmic regret, Extracting certainty from uncertainty: regret bounded by variation in costs, Regret bounded by gradual variation for online convex optimization