Nonparametric bandit methods

From MaRDI portal

Publication:806690

Jump to:navigation, search

DOI10.1007/BF02055587zbMath0729.90089MaRDI QIDQ806690

Sid Yakowitz, Wing Lowe

Publication date: 1991

Published in: Annals of Operations Research (Search for Journal in Brave)

zbMATH Keywords

infinite-horizon bandit problem nonparametric setting

Mathematics Subject Classification ID

Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Markov and semi-Markov decision processes (90C40)

Related Items (10)

A non-parametric solution to the multi-armed bandit problem with covariates ⋮ The time until the final zero crossing of random sums with application to nonparametric bandit theory ⋮ Bandit and covariate processes, with finite or non-denumerable set of arms ⋮ MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS ⋮ The multi-armed bandit problem: an efficient nonparametric solution ⋮ An asymptotically optimal policy for finite support models in the multiarmed bandit problem ⋮ A decision model and methodology for the AIDS epidemic ⋮ Machine learning for optimal blackjack counting strategies ⋮ Policies without Memory for the Infinite-Armed Bernoulli Bandit under the Average-Reward Criterion ⋮ Sequential design with applications to the trim-loss problem

Cites Work

This page was built for publication: Nonparametric bandit methods

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:806690&oldid=12741617"