Distributed dynamic programming

From MaRDI portal
Revision as of 23:39, 5 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:3955990

DOI10.1109/TAC.1982.1102980zbMath0493.49030OpenAlexW2119380668MaRDI QIDQ3955990

Dimitri P. Bertsekas

Publication date: 1982

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/tac.1982.1102980




Related Items (21)

Approximate policy iteration: a survey and some new methodsA bisection/successive approximation method for computing Gittins indicesParallel asynchronous label-correcting methods for shortest pathsAsynchronous gradient algorithms for a class of convex separable network flow problemsQ-learning and policy iteration algorithms for stochastic shortest path problemsModel-based average reward reinforcement learningReal-time dynamic programming for Markov decision processes with imprecise probabilitiesOn the stability of asynchronous iterative processesIndependent learning in stochastic gamesRobust topological policy iteration for infinite horizon bounded Markov decision processesA new class of asynchronous iterative algorithms with order intervalsComputationally efficient algorithms for on-line optimization of Markov decision processesExtended duality for nonlinear programmingA tutorial survey of reinforcement learningParallel decomposition of multistage stochastic programming problemsDistributed supply chain management using ant colony optimizationQuicker Convergence for Iterative Numerical Solutions to Stochastic Problems: Probabilistic Interpretations, Ordering Heuristics, and Parallel ProcessingRobust shortest path planning and semicontractive dynamic programmingDistributed asynchronous computation of fixed pointsSome aspects of parallel and distributed iterative algorithms - a surveyRobust event-driven interactions in cooperative multi-agent learning







This page was built for publication: Distributed dynamic programming