Convergence Properties of Policy Iteration
From MaRDI portal
Publication:4652513
DOI10.1137/S0363012902399824zbMath1134.90530OpenAlexW2061508005WikidataQ56813305 ScholiaQ56813305MaRDI QIDQ4652513
Publication date: 28 February 2005
Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1137/s0363012902399824
complexitypolicy iterationcomputational costmethod of successive approximationsquadratic and superlinear convergence
Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).
Related Items (28)
Multilevel techniques for the solution of HJB minimum-time control problems ⋮ Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming ⋮ Rates of convergence for the policy iteration method for mean field games systems ⋮ Penalty and penalty-like methods for nonlinear HJB PDEs ⋮ Envelope condition method with an application to default risk models ⋮ Optimal investment strategies for pension funds with regulation-conform dynamic pension payment management in the absence of guarantees ⋮ Optimal polynomial feedback laws for finite horizon control problems ⋮ Policy iteration method for time-dependent mean field games systems with non-separable Hamiltonians ⋮ A power penalty method for discrete HJB equations ⋮ A note on generalized second-order value iteration in Markov decision processes ⋮ Exponential Convergence and Stability of Howard's Policy Improvement Algorithm for Controlled Diffusions ⋮ A semi-Lagrangian scheme for a modified version of the Hughes' model for Pedestrian flow ⋮ Optimal investment-consumption problem: post-retirement with minimum guarantee ⋮ Optimal control of Boolean control networks with average cost: a policy iteration approach ⋮ Feedback control of parametrized PDEs via model order reduction and dynamic programming principle ⋮ A semi-Lagrangian algorithm in policy space for hybrid optimal control problems ⋮ Optimal consumption under uncertainty, liquidity constraints, and bounded rationality ⋮ Domain decomposition based parallel Howard's algorithm ⋮ Policy iteration for continuous-time average reward Markov decision processes in Polish spaces ⋮ Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion ⋮ A neural network-based policy iteration algorithm with global \(H^2\)-superlinear convergence for stochastic games on domains ⋮ Power penalty method for solving HJB equations arising from finance ⋮ The primal-dual active set method for a class of nonlinear problems with \(T\)-monotone operators ⋮ A policy iteration method for mean field games ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ Optimal price-threshold control for battery operation with aging phenomenon: a quasiconvex optimization approach ⋮ Continuous vs. discrete time: some computational insights ⋮ An Accelerated Value/Policy Iteration Scheme for Optimal Control Problems and Games
This page was built for publication: Convergence Properties of Policy Iteration