Convergence Properties of Policy Iteration

From MaRDI portal

Publication:4652513

Jump to:navigation, search

DOI10.1137/S0363012902399824zbMath1134.90530OpenAlexW2061508005WikidataQ56813305 ScholiaQ56813305MaRDI QIDQ4652513

John Rust, Manuel S. Santos

Publication date: 28 February 2005

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1137/s0363012902399824

zbMATH Keywords

complexity policy iteration computational cost method of successive approximations quadratic and superlinear convergence

Mathematics Subject Classification ID

Lua error in Module:PublicationMSCList at line 37: attempt to index local 'msc_result' (a nil value).

Related Items (28)

Multilevel techniques for the solution of HJB minimum-time control problems ⋮ Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming ⋮ Rates of convergence for the policy iteration method for mean field games systems ⋮ Penalty and penalty-like methods for nonlinear HJB PDEs ⋮ Envelope condition method with an application to default risk models ⋮ Optimal investment strategies for pension funds with regulation-conform dynamic pension payment management in the absence of guarantees ⋮ Optimal polynomial feedback laws for finite horizon control problems ⋮ Policy iteration method for time-dependent mean field games systems with non-separable Hamiltonians ⋮ A power penalty method for discrete HJB equations ⋮ A note on generalized second-order value iteration in Markov decision processes ⋮ Exponential Convergence and Stability of Howard's Policy Improvement Algorithm for Controlled Diffusions ⋮ A semi-Lagrangian scheme for a modified version of the Hughes' model for Pedestrian flow ⋮ Optimal investment-consumption problem: post-retirement with minimum guarantee ⋮ Optimal control of Boolean control networks with average cost: a policy iteration approach ⋮ Feedback control of parametrized PDEs via model order reduction and dynamic programming principle ⋮ A semi-Lagrangian algorithm in policy space for hybrid optimal control problems ⋮ Optimal consumption under uncertainty, liquidity constraints, and bounded rationality ⋮ Domain decomposition based parallel Howard's algorithm ⋮ Policy iteration for continuous-time average reward Markov decision processes in Polish spaces ⋮ Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion ⋮ A neural network-based policy iteration algorithm with global \(H^2\)-superlinear convergence for stochastic games on domains ⋮ Power penalty method for solving HJB equations arising from finance ⋮ The primal-dual active set method for a class of nonlinear problems with \(T\)-monotone operators ⋮ A policy iteration method for mean field games ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ Optimal price-threshold control for battery operation with aging phenomenon: a quasiconvex optimization approach ⋮ Continuous vs. discrete time: some computational insights ⋮ An Accelerated Value/Policy Iteration Scheme for Optimal Control Problems and Games

This page was built for publication: Convergence Properties of Policy Iteration

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4652513&oldid=18848729"

Pages with script errors