Adaptive aggregation methods for infinite horizon dynamic programming

From MaRDI portal
Publication:3830830

DOI10.1109/9.24227zbMath0675.90089OpenAlexW2101167844MaRDI QIDQ3830830

Dimitri P. Bertsekas, David A. Castanon

Publication date: 1989

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/9.24227




Related Items

Adaptive differential dynamic programming for multiobjective optimal controlNumerical methods for controlled and uncontrolled multiplexing and queueing systemsAn incremental off-policy search in a model-free Markov decision process using a single sample pathApproximate dynamic programming with state aggregation applied to UAV perimeter patrolBlock-scaling of value-iteration for discounted Markov renewal programmingReplacement process decomposition for discounted Markov renewal programmingGeneric rank-one corrections for value iteration in Markovian decision problemsFeature-based methods for large scale dynamic programmingTwo adaptively stepped monotone algorithms for solving discounted dynamic programming equationsAdaptive aggregation for reinforcement learning in average reward Markov decision processesA variable neighborhood search based algorithm for finite-horizon Markov decision processesAbstraction and approximate decision-theoretic planning.Agent's actions as a classification criteria for the state space in a learning from rewards systemMultilevel Preconditioners for Temporal-Difference Learning Methods Related to Recommendation EnginesAuction algorithms for network flow problems: A tutorial introductionWhat you should know about approximate dynamic programmingNear optimization of dynamic systems by decomposition and aggregationStochastic dynamic programming with factored representationsBounded-parameter Markov decision processesAn aggregation-based approximate dynamic programming approach for the periodic review model with random yieldApproximate dynamic programming via iterated Bellman inequalitiesEstimating equilibrium probabilities for band diagonal Markov chains using aggregation and disaggregation techniquesSolving factored MDPs using non-homogeneous partitions