Learning Representation and Control in Markov Decision Processes: New Frontiers
From MaRDI portal
Publication:3580907
Recommendations
- Learning Control of Dynamical Systems Based on Markov Decision Processes: Research Frontiers and Outlooks
- Learning control of finite Markov chains with an explicit trade-off between estimation and control
- Reinforcement learning in robust Markov decision processes
- Reinforcement Learning for Sequential Decision and Optimal Control
- From reinforcement learning to optimal control: a unified framework for sequential decisions
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- Reinforcement learning of non-Markov decision processes
Cited in
(8)- \(L^\ast\)-based learning of Markov decision processes (extended version)
- Markov reward models and Markov decision processes in discrete and continuous time: performance evaluation and optimization
- scientific article; zbMATH DE number 5957492 (Why is no real title available?)
- Low-Rank Representation of Reinforcement Learning Policies
- A new decision making model based on rank centrality for GDM with fuzzy preference relations
- Framing reinforcement learning from human reward: reward positivity, temporal discounting, episodicity, and performance
- Model-based Reinforcement Learning: A Survey
- Dimension reduction and its application to model-based exploration in continuous spaces
This page was built for publication: Learning Representation and Control in Markov Decision Processes: New Frontiers
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3580907)