The variational calculus and approximation in policy space for Markovian decision processes
From MaRDI portal
Publication:1068009
DOI10.1016/0022-247X(85)90197-0zbMATH Open0579.90097MaRDI QIDQ1068009FDOQ1068009
Authors: Paul J. Schweitzer
Publication date: 1985
Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)
Recommendations
- Variational characterizations in Markov decision processes
- scientific article; zbMATH DE number 2159172
- Variance minimization of parameterized Markov decision processes
- Markov Decision Processes with Variance Minimization: A New Condition and Approach
- Optimization of Markov decision processes under the variance criterion
- Variance minimization for continuous-time Markov decision processes: two approaches
- On Finding Optimal Policies for Markov Decision Chains: A Unifying Framework for Mean-Variance-Tradeoffs
- scientific article; zbMATH DE number 4037634
- Computational approaches to variance-penalized Markov decision processes
- Generalized polynomial approximations in Markovian decision processes
Numerical optimization and variational techniques (65K10) Markov and semi-Markov decision processes (90C40)
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Perturbation theory and finite Markov chains
- Markov Renewal Programs with Small Interest Rates
- Title not available (Why is that?)
- The Functional Equations of Undiscounted Markov Renewal Programming
- Markov-Renewal Programming. I: Formulation, Finite Return Models
- Title not available (Why is that?)
- Discrete Dynamic Programming with Sensitive Discount Optimality Criteria
- On Finding Optimal Policies in Discrete Dynamic Programming with No Discounting
- On the Convergence of Policy Iteration in Stationary Dynamic Programming
- Perturbation Theory and Undiscounted Markov Renewal Programming
- Title not available (Why is that?)
- Algorithms for Stochastic Games with Geometrical Interpretation
Cited In (6)
- Policy Improvement and the Newton-Raphson Algorithm
- Successive Approximation Methods for Solving Nested Functional Equations in Markov Decision Problems
- Technical note -- cyclic variables and Markov decision processes
- Variational characterizations in Markov decision processes
- Generalized polynomial approximations in Markovian decision processes
- Technical Note—Successive Approximations in Value Determination for a Markov Decision Process
This page was built for publication: The variational calculus and approximation in policy space for Markovian decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1068009)