Learning Representation and Control in Markov Decision Processes: New Frontiers
DOI10.1561/2200000003zbMATH Open1192.93010OpenAlexW2072054128MaRDI QIDQ3580907FDOQ3580907
Authors: Sridhar Mahadevan
Publication date: 14 August 2010
Published in: Foundations and Trends® in Machine Learning (Search for Journal in Brave)
Full work available at URL: https://semanticscholar.org/paper/481dc549e4fc8e0895fe98080dcd6fba138f7136
Recommendations
- Learning Control of Dynamical Systems Based on Markov Decision Processes: Research Frontiers and Outlooks
- Learning control of finite Markov chains with an explicit trade-off between estimation and control
- Reinforcement learning in robust Markov decision processes
- Reinforcement Learning for Sequential Decision and Optimal Control
- From reinforcement learning to optimal control: a unified framework for sequential decisions
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- Reinforcement learning of non-Markov decision processes
Markov decision processesDrazin inversegeneric algorithmgeneralized spectral inverse of Laplacianmachine learning frameworkrepresentation policy iterationsolution of sequential decision problems
Markov and semi-Markov decision processes (90C40) Hierarchical systems (93A13) Stochastic learning and adaptive control (93E35)
Cited In (8)
- \(L^\ast\)-based learning of Markov decision processes (extended version)
- Title not available (Why is that?)
- Low-Rank Representation of Reinforcement Learning Policies
- Markov Reward Models and Markov Decision Processes in Discrete and Continuous Time: Performance Evaluation and Optimization
- A new decision making model based on rank centrality for GDM with fuzzy preference relations
- Model-based Reinforcement Learning: A Survey
- Framing reinforcement learning from human reward: reward positivity, temporal discounting, episodicity, and performance
- Dimension reduction and its application to model-based exploration in continuous spaces
This page was built for publication: Learning Representation and Control in Markov Decision Processes: New Frontiers
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3580907)