Pages that link to "Item:Q653803"

From MaRDI portal

← UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem (Q653803)

Jump to:navigation, search

The following pages link to UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem (Q653803):

Displaying 7 items.

Batched bandit problems (Q282463) (← links)
Modification of improved upper confidence bounds for regulating exploration in Monte-Carlo tree search (Q307787) (← links)
The multi-armed bandit problem with covariates (Q355096) (← links)
Trading utility and uncertainty: applying the value of information to resolve the exploration-exploitation dilemma in reinforcement learning (Q2094051) (← links)
Ballooning multi-armed bandits (Q2238588) (← links)
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems (Q5219722) (← links)
Transfer learning for contextual multi-armed bandits (Q6192325) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere"