Pages that link to "Item:Q5113912"
From MaRDI portal
The following pages link to Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards (Q5113912):
Displaying 5 items.
- Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks (Q2021298) (← links)
- Fully probabilistic design of strategies with estimator (Q2139380) (← links)
- Lipschitzness is all you need to tame off-policy generative adversarial imitation learning (Q2163202) (← links)
- Setting Reserve Prices in Second-Price Auctions with Unobserved Bids (Q5060778) (← links)
- Model-based preference quantification (Q6136128) (← links)