Incremental Value Iteration for Time-Aggregated Markov-Decision Processes
From MaRDI portal
Publication:5282301
Cited in
(7)- A numerical study of Markov decision process algorithms for multi-component replacement problems
- A unified approach to time-aggregated Markov decision processes
- Event-based optimization approach for solving stochastic decision problems with probabilistic constraint
- Time aggregated Markov decision processes via standard dynamic programming
- Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm
- Approximate Value Iteration with Temporally Extended Actions
- Revenue management for operations with urgent orders
This page was built for publication: Incremental Value Iteration for Time-Aggregated Markov-Decision Processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5282301)