The following pages link to Online Markov Decision Processes (Q3169063):
Displayed 5 items.
- Online regret bounds for Markov decision processes with deterministic transitions (Q982638) (← links)
- Online spatio-temporal matching in stochastic and dynamic domains (Q1648078) (← links)
- Approachability in Stackelberg stochastic games with vector costs (Q1707454) (← links)
- Reinforcement Learning in Robust Markov Decision Processes (Q2833106) (← links)
- Chasing Ghosts: Competing with Stateful Policies (Q2968152) (← links)