Regret in the on-line decision problem

From MaRDI portal
Publication:1818283

DOI10.1006/game.1999.0740zbMath0984.91025OpenAlexW2135730283MaRDI QIDQ1818283

Rakesh V. Vohra, Dean P. Foster

Publication date: 2 May 2001

Published in: Games and Economic Behavior (Search for Journal in Brave)

Full work available at URL: http://www.dklevine.com/archive/refs4569.pdf




Related Items (50)

Achieving Unbounded Resolution inFinitePlayer Goore Games Using Stochastic Automata, and Its ApplicationsApproachability in population gamesReplicator dynamics: old and newA general internal regret-free strategyLearning correlated equilibria in games with compact sets of strategiesOnline calibrated forecasts: memory efficiency versus universality for learning in gamesA general criterion and an algorithmic framework for learning in multi-agent systemsRationality Authority for Provable Rational BehaviorApproachability with bounded memoryRobust mean field gamesDynamic benchmark targetingUnnamed ItemConstrained no-regret learningPrior‐free dynamic allocation under limited liabilityPredicting a binary sequence almost as well as the optimal biased coinNo-regret algorithms in on-line learning, games and convex optimizationLearning to compete, coordinate, and cooperate in repeated games using reinforcement learningUnnamed ItemOptimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary RewardsLearning, hypothesis testing, and Nash equilibrium.Approachability, regret and calibration: implications and equivalencesRegret minimization in repeated matrix games with variable stage durationLoad balancing without regret in the bulletin board modelRandomized prediction of individual sequencesDeterministic calibration and Nash equilibriumRepeated Games with Incomplete InformationExponential weight algorithm in continuous timeOpinion dynamics and learning in social networksA general class of adaptive strategiesStable games and their dynamicsComputer science and decision theoryA Robust Saturated Strategy for $n$-Player Prisoner's DilemmaInternal regret in on-line portfolio selectionInternal regret in on-line portfolio selectionMaximin effects in inhomogeneous large-scale dataLearning by trial and errorWhen autonomous agents model other agents: an appeal for altered judgment coupled with mouths, ears, and a little more tapeStrategic learning in games with symmetric information.A wide range no-regret theoremPrice probabilities: a class of Bayesian and non-Bayesian prediction rulesAdaptive game playing using multiplicative weightsConditional universal consistency.Minimizing regret: The general caseUnnamed ItemIf multi-agent learning is the answer, what is the question?Agendas for multi-agent learningThe possible and the impossible in multi-agent learningA hierarchy of prescriptive goals for multiagent learningOnline Discrete Optimization in Social Networks in the Presence of Knightian UncertaintyCalibration and Internal No-Regret with Random Signals


Uses Software


Cites Work


This page was built for publication: Regret in the on-line decision problem