Pages that link to "Item:Q3520073"
From MaRDI portal
The following pages link to Pseudometrics for State Aggregation in Average Reward Markov Decision Processes (Q3520073):
Displayed 5 items.
- Extreme state aggregation beyond Markov decision processes (Q329613) (← links)
- Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753) (← links)
- Regret bounds for restless Markov bandits (Q465253) (← links)
- Online Regret Bounds for Markov Decision Processes with Deterministic Transitions (Q3529915) (← links)
- A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs (Q5227201) (← links)