On State Aggregation to Approximate Complex Value Functions in Large-Scale Markov Decision Processes
From MaRDI portal
Publication:5347606
Recommendations
- Performance Loss Bounds for Approximate Value Iteration with State Aggregation
- Extreme state aggregation beyond MDPs
- Extreme state aggregation beyond Markov decision processes
- Approximation of Markov decision processes with general state space
- Aggregation of the policy iteration method for nearly completely decomposable Markov chains
- Pseudometrics for State Aggregation in Average Reward Markov Decision Processes
- Approximate policy iteration for Markov decision processes via quantitative adaptive aggregations
- Approximating Markov decision processes using expected state transitions
- Finite-state approximations for denumerable multidimensional state discounted Markov decision processes
- scientific article; zbMATH DE number 1509479
Cited in
(7)- Parameterized Markov decision process and its application to service rate control
- Revenue management for operations with urgent orders
- Learning to agree over large state spaces
- Power and delay optimisation in multi-hop wireless networks
- Modified iterative aggregation procedure for maintenance optimisation of multi-component systems with failure interaction
- FUZZY STATE AGGREGATION AND POLICY HILL CLIMBING FOR STOCHASTIC ENVIRONMENTS
- Control-limit policies for a class of stopping time problems with termination restrictions
This page was built for publication: On State Aggregation to Approximate Complex Value Functions in Large-Scale Markov Decision Processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5347606)