Convergence Properties of Policy Iteration

From MaRDI portal
Publication:4652513

DOI10.1137/S0363012902399824zbMath1134.90530OpenAlexW2061508005WikidataQ56813305 ScholiaQ56813305MaRDI QIDQ4652513

John Rust, Manuel S. Santos

Publication date: 28 February 2005

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1137/s0363012902399824



Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).


Related Items (28)

Multilevel techniques for the solution of HJB minimum-time control problemsUndiscounted control policy generation for continuous-valued optimal control by approximate dynamic programmingRates of convergence for the policy iteration method for mean field games systemsPenalty and penalty-like methods for nonlinear HJB PDEsEnvelope condition method with an application to default risk modelsOptimal investment strategies for pension funds with regulation-conform dynamic pension payment management in the absence of guaranteesOptimal polynomial feedback laws for finite horizon control problemsPolicy iteration method for time-dependent mean field games systems with non-separable HamiltoniansA power penalty method for discrete HJB equationsA note on generalized second-order value iteration in Markov decision processesExponential Convergence and Stability of Howard's Policy Improvement Algorithm for Controlled DiffusionsA semi-Lagrangian scheme for a modified version of the Hughes' model for Pedestrian flowOptimal investment-consumption problem: post-retirement with minimum guaranteeOptimal control of Boolean control networks with average cost: a policy iteration approachFeedback control of parametrized PDEs via model order reduction and dynamic programming principleA semi-Lagrangian algorithm in policy space for hybrid optimal control problemsOptimal consumption under uncertainty, liquidity constraints, and bounded rationalityDomain decomposition based parallel Howard's algorithmPolicy iteration for continuous-time average reward Markov decision processes in Polish spacesApproximation of two-person zero-sum continuous-time Markov games with average payoff criterionA neural network-based policy iteration algorithm with global \(H^2\)-superlinear convergence for stochastic games on domainsPower penalty method for solving HJB equations arising from financeThe primal-dual active set method for a class of nonlinear problems with \(T\)-monotone operatorsA policy iteration method for mean field gamesA perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costsOptimal price-threshold control for battery operation with aging phenomenon: a quasiconvex optimization approachContinuous vs. discrete time: some computational insightsAn Accelerated Value/Policy Iteration Scheme for Optimal Control Problems and Games




This page was built for publication: Convergence Properties of Policy Iteration