How Fast Is the Bandit?

From MaRDI portal

Publication:3506304

Jump to:navigation, search

DOI10.1080/07362990802007202zbMath1416.62456arXivmath/0510351OpenAlexW2056330474MaRDI QIDQ3506304

Damien Lamberton, Gilles Pagès

Publication date: 12 June 2008

Published in: Stochastic Analysis and Applications (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/math/0510351

zbMATH Keywords

learning automata stochastic approximation asset allocation two-armed bandit algorithm

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Adaptive control/observation systems (93C40) Stochastic approximation (62L20)

Related Items (4)

Regret bounds for Narendra-Shapiro bandit algorithms ⋮ On ergodic two-armed bandits ⋮ Convergence in models with bounded expected relative hazard rates ⋮ Nonlinear randomized urn models: a stochastic approximation viewpoint

Cites Work

This page was built for publication: How Fast Is the Bandit?

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3506304&oldid=16860239"