Pages that link to "Item:Q4173218"
From MaRDI portal
The following pages link to Modified Policy Iteration Algorithms for Discounted Markov Decision Problems (Q4173218):
Displayed 30 items.
- Design and evaluation of norm-aware agents based on normative Markov decision processes (Q324660) (← links)
- (Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751) (← links)
- A survey of solution techniques for the partially observed Markov decision process (Q804478) (← links)
- Contingent planning under uncertainty via stochastic satisfiability (Q814473) (← links)
- Truncated policy iteration methods (Q1060136) (← links)
- Reward revision and the average reward Markov decision process (Q1097179) (← links)
- A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes (Q1266643) (← links)
- Abstraction and approximate decision-theoretic planning. (Q1399130) (← links)
- Stochastic dynamic programming with factored representations (Q1583230) (← links)
- Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming (Q1785275) (← links)
- Generic rank-one corrections for value iteration in Markovian decision problems (Q1905071) (← links)
- Dynamic programming and value-function approximation in sequential decision problems: error analysis and numerical results (Q1949593) (← links)
- A note on policy algorithms for discounted Markov decision problems (Q1969768) (← links)
- Applications of Markov chain approximation methods to optimal control problems in economics (Q2097976) (← links)
- Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems (Q2293273) (← links)
- Admission control in a two-class loss system with periodically varying parameters and abandonments (Q2302276) (← links)
- Accelerated modified policy iteration algorithms for Markov decision processes (Q2391867) (← links)
- Approximate dynamic programming for stochastic \(N\)-stage optimization with application to optimal consumption under uncertainty (Q2450902) (← links)
- On integral generalized policy iteration for continuous-time linear quadratic regulations (Q2628425) (← links)
- Learning classifier systems: a survey (Q2642997) (← links)
- Complexity bounds for approximately solving discounted MDPs by value iterations (Q2661516) (← links)
- Stability and monotone convergence of generalised policy iteration for discrete-time linear quadratic regulations (Q2792734) (← links)
- (Q3585656) (← links)
- Improved iterative computation of the expected discounted return in Markov and semi-Markov chains (Q3885559) (← links)
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139) (← links)
- A method of bisection for discounted Markov decision problems (Q4199858) (← links)
- A semi-Lagrangian algorithm in policy space for hybrid optimal control problems (Q4646817) (← links)
- (Q5053310) (← links)
- DYNAMIC CONTROL OF A SINGLE-SERVER SYSTEM WHEN JOBS CHANGE STATUS (Q5242840) (← links)
- Markov decision processes (Q5904001) (← links)