scientific article
From MaRDI portal
Publication:3046711
zbMath1050.68059MaRDI QIDQ3046711
Shie Mannor, Yishay Mansour, Eyal Even-Dar
Publication date: 12 August 2004
Full work available at URL: http://link.springer.de/link/service/series/0558/bibs/2375/23750255.htm
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (21)
Best Arm Identification for Contaminated Bandits ⋮ Approximation algorithms for stochastic combinatorial optimization problems ⋮ Sequential estimation of quantiles with applications to A/B testing and best-arm identification ⋮ Solving Large-Scale Fixed-Budget Ranking and Selection Problems ⋮ Always Valid Inference: Continuous Monitoring of A/B Tests ⋮ Unnamed Item ⋮ An asymptotically optimal policy for finite support models in the multiarmed bandit problem ⋮ Pure exploration in finitely-armed and continuous-armed bandits ⋮ Amplification and Derandomization without Slowdown ⋮ Tractable Sampling Strategies for Ordinal Optimization ⋮ Simple Bayesian Algorithms for Best-Arm Identification ⋮ Efficient PAC learning for episodic tasks with acyclic state spaces ⋮ An analysis of model-based interval estimation for Markov decision processes ⋮ Adaptive Incentive-Compatible Sponsored Search Auction ⋮ Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm ⋮ Polynomial-Time Algorithms for Multiple-Arm Identification with Full-Bandit Feedback ⋮ Bayesian Incentive-Compatible Bandit Exploration ⋮ Pure Exploration in Multi-armed Bandits Problems ⋮ A PAC algorithm in relative precision for bandit problem with costly sampling ⋮ Trading utility and uncertainty: applying the value of information to resolve the exploration-exploitation dilemma in reinforcement learning ⋮ Knockout-Tournament Procedures for Large-Scale Ranking and Selection in Parallel Computing Environments
Uses Software
This page was built for publication: