Simulation-based optimization of Markov decision processes: an empirical process theory approach

From MaRDI portal

Revision as of 07:52, 30 January 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:608432

Jump to:navigation, search

DOI10.1016/j.automatica.2010.05.021zbMath1204.93132OpenAlexW2071767680MaRDI QIDQ608432

Rahul Jain, Pravin P. Varaiya

Publication date: 25 November 2010

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.automatica.2010.05.021

zbMATH Keywords

optimization learning algorithms Markov decision processes stochastic control Monte Carlo simulation

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic programming (90C15) Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Optimal stochastic control (93E20)

Related Items (3)

Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms ⋮ Relevant states and memory in Markov chain bootstrapping and simulation ⋮ Empirical Dynamic Programming

Cites Work

This page was built for publication: Simulation-based optimization of Markov decision processes: an empirical process theory approach

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:608432&oldid=12500870"