Pages that link to "Item:Q959899"
From MaRDI portal
The following pages link to An analysis of model-based interval estimation for Markov decision processes (Q959899):
Displayed 7 items.
- Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753) (← links)
- Near-optimal PAC bounds for discounted MDPs (Q465258) (← links)
- Bayesian optimistic Kullback-Leibler exploration (Q2425228) (← links)
- (Q4998915) (← links)
- Deep Reinforcement Learning: A State-of-the-Art Walkthrough (Q5145831) (← links)
- (Q5149240) (← links)
- Identity concealment games: how I learned to stop revealing and love the coincidences (Q6119741) (← links)