Q-learning for distributionally robust Markov decision processes
From MaRDI portal
Publication:5153603
Recommendations
- Distributionally robust Markov decision processes
- Distributionally robust optimization for sequential decision-making
- Distributionally Robust Markov Decision Processes and Their Connection to Risk Measures
- Reinforcement learning in robust Markov decision processes
- Distributionally robust partially observable Markov decision process with moment-based ambiguity
Cites work
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 1325008 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 1965513 (Why is no real title available?)
- scientific article; zbMATH DE number 3320878 (Why is no real title available?)
- Ambiguity Aversion, Robustness, and the Variational Representation of Preferences
- Ambiguity in asset pricing and portfolio choice: a review of the literature
- Distributionally robust Markov decision processes
- Markov decision processes with applications to finance.
- Mathematical risk analysis. Dependence, risk bounds, optimal allocations and portfolios
- Minimax Control of Discrete-Time Stochastic Systems
- Premiums and reserves, adjusted by distortions
- Quantitative risk management. Concepts, techniques and tools
- Robust Dynamic Programming
- Robust Markov Decision Processes
- Robust Markov control processes
Cited in
(6)- Markov decision processes under model uncertainty
- Prospect-theoretic Q-learning
- Q-learning for Markov decision processes with a satisfiability criterion
- Robust \(Q\)-learning algorithm for Markov decision processes under Wasserstein uncertainty
- Speedy categorical distributional reinforcement learning and complexity analysis
- A dynamical neural network approach for distributionally robust chance-constrained Markov decision process
This page was built for publication: Q-learning for distributionally robust Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5153603)