Pages that link to "Item:Q5113912"
From MaRDI portal
The following pages link to Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards (Q5113912):
Displayed 7 items.
- Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks (Q2021298) (← links)
- Fully probabilistic design of strategies with estimator (Q2139380) (← links)
- Lipschitzness is all you need to tame off-policy generative adversarial imitation learning (Q2163202) (← links)
- (Q4998863) (← links)
- Setting Reserve Prices in Second-Price Auctions with Unobserved Bids (Q5060778) (← links)
- Robust sequential design for piecewise-stationary multi-armed bandit problem in the presence of outliers (Q5880072) (← links)
- Model-based preference quantification (Q6136128) (← links)