Single sample path-based optimization of Markov chains

From MaRDI portal

Publication:1289394

Jump to:navigation, search

DOI10.1023/A:1022634422482zbMath0949.90092OpenAlexW1497026160MaRDI QIDQ1289394

Publication date: 28 November 2000

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1023/a:1022634422482

zbMATH Keywords

Markov decision processes perturbation analysis on-line optimization performance potentials

Mathematics Subject Classification ID

Stochastic programming (90C15) Markov and semi-Markov decision processes (90C40)

Related Items (7)

A time aggregation approach to Markov decision processes ⋮ Potential-based least-squares policy iteration for a parameterized feedback control system ⋮ The control of a two-level Markov decision process by time aggregation ⋮ Generalized estimates for performance sensitivities of stochastic systems ⋮ Temporal difference-based policy iteration for optimal control of stochastic systems ⋮ A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases ⋮ Basic ideas for event-based optimization of Markov systems

Cites Work

This page was built for publication: Single sample path-based optimization of Markov chains

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1289394&oldid=13395607"