scientific article; zbMATH DE number 4045510
From MaRDI portal
Publication:3783097
zbMATH Open0641.90087MaRDI QIDQ3783097FDOQ3783097
Authors: Onésimo Hernández-Lerma
Publication date: 1985
Title of this publication is not available (Why is that?)
Recommendations
stochastic dynamic programmingasymptotic discount optimalityexpected total discounted rewardapproximation of dynamic programsnonstationary value- iterationadaptive control policiespolish state and control spaces
Dynamic programming (90C39) Stochastic programming (90C15) Special processes (60K99) Markov and semi-Markov decision processes (90C40)
Cited In (20)
- Title not available (Why is that?)
- Continuous dependence of stochastic control models on the noise distribution
- Discretization procedures for adaptive Markov control processes
- Suboptimal solutions to dynamic optimization problems via approximations of the policy functions
- Approximate policy optimization and adaptive control in regression models
- Adaptive policies for stochastic systems under a randomized discounted cost criterion
- Title not available (Why is that?)
- Pointwise approximations of discounted Markov decision processes to optimal policies
- Generalized polynomial approximations in Markovian decision processes
- Adaptive Markov control processes
- Title not available (Why is that?)
- Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- Adaptive control of discounted Markov decision chains
- Optimal adaptive policies for sequential allocation problems
- Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
- An asynchronous stochastic approximation theorem and some applications
- Nonparametric adaptive control of discrete-time partially observable stochastic systems
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3783097)