Regret in the on-line decision problem

From MaRDI portal

Publication:1818283

Jump to:navigation, search

DOI10.1006/game.1999.0740zbMath0984.91025OpenAlexW2135730283MaRDI QIDQ1818283

Rakesh V. Vohra, Dean P. Foster

Publication date: 2 May 2001

Published in: Games and Economic Behavior (Search for Journal in Brave)

Full work available at URL: http://www.dklevine.com/archive/refs4569.pdf

zbMATH Keywords

decision making regret actual state of the world uncertaint

Mathematics Subject Classification ID

Decision theory (91B06)

Related Items (50)

Achieving Unbounded Resolution inFinitePlayer Goore Games Using Stochastic Automata, and Its Applications ⋮ Approachability in population games ⋮ Replicator dynamics: old and new ⋮ A general internal regret-free strategy ⋮ Learning correlated equilibria in games with compact sets of strategies ⋮ Online calibrated forecasts: memory efficiency versus universality for learning in games ⋮ A general criterion and an algorithmic framework for learning in multi-agent systems ⋮ Rationality Authority for Provable Rational Behavior ⋮ Approachability with bounded memory ⋮ Robust mean field games ⋮ Dynamic benchmark targeting ⋮ Unnamed Item ⋮ Constrained no-regret learning ⋮ Prior‐free dynamic allocation under limited liability ⋮ Predicting a binary sequence almost as well as the optimal biased coin ⋮ No-regret algorithms in on-line learning, games and convex optimization ⋮ Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning ⋮ Unnamed Item ⋮ Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards ⋮ Learning, hypothesis testing, and Nash equilibrium. ⋮ Approachability, regret and calibration: implications and equivalences ⋮ Regret minimization in repeated matrix games with variable stage duration ⋮ Load balancing without regret in the bulletin board model ⋮ Randomized prediction of individual sequences ⋮ Deterministic calibration and Nash equilibrium ⋮ Repeated Games with Incomplete Information ⋮ Exponential weight algorithm in continuous time ⋮ Opinion dynamics and learning in social networks ⋮ A general class of adaptive strategies ⋮ Stable games and their dynamics ⋮ Computer science and decision theory ⋮ A Robust Saturated Strategy for $n$-Player Prisoner's Dilemma ⋮ Internal regret in on-line portfolio selection ⋮ Internal regret in on-line portfolio selection ⋮ Maximin effects in inhomogeneous large-scale data ⋮ Learning by trial and error ⋮ When autonomous agents model other agents: an appeal for altered judgment coupled with mouths, ears, and a little more tape ⋮ Strategic learning in games with symmetric information. ⋮ A wide range no-regret theorem ⋮ Price probabilities: a class of Bayesian and non-Bayesian prediction rules ⋮ Adaptive game playing using multiplicative weights ⋮ Conditional universal consistency. ⋮ Minimizing regret: The general case ⋮ Unnamed Item ⋮ If multi-agent learning is the answer, what is the question? ⋮ Agendas for multi-agent learning ⋮ The possible and the impossible in multi-agent learning ⋮ A hierarchy of prescriptive goals for multiagent learning ⋮ Online Discrete Optimization in Social Networks in the Presence of Knightian Uncertainty ⋮ Calibration and Internal No-Regret with Random Signals

Uses Software

AdaBoost.MH

Cites Work

This page was built for publication: Regret in the on-line decision problem

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1818283&oldid=14179112"