scientific article; zbMATH DE number 4031438
From MaRDI portal
Publication:3772003
zbMATH Open0633.90091MaRDI QIDQ3772003FDOQ3772003
Authors: Onésimo Hernández-Lerma
Publication date: 1987
Full work available at URL: https://eudml.org/doc/28802
Title of this publication is not available (Why is that?)
Recommendations
unknown parametersapproximation proceduresvalue-iterationaverage-reward controlled Markov processesBorel state and control spacesoptimal adaptive policies
Cites Work
- Dynamic programming, Markov chains, and the method of successive approximations
- Title not available (Why is that?)
- Stochastic optimal control. The discrete time case
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Title not available (Why is that?)
- A Survey of Some Results in Stochastic Adaptive Control
- Title not available (Why is that?)
- The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms
- Optimal Plans for Dynamic Programming Problems
- Estimation and control in Markov chains
- Title not available (Why is that?)
- Title not available (Why is that?)
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Optimal adaptive control of priority assignment in queueing systems
- Adaptive control of service in queueing systems
- Nonstationary Markov decision problems with converging parameters
- Nonparametric adaptive control of discrete-time partially observable stochastic systems
- The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Title not available (Why is that?)
- Title not available (Why is that?)
- Adaptive Strategies for Certain Classes of Controlled Markov Processes
- AVERAGE-OPTIMAL ADAPTIVE POLICIES IN SEMI-MARKOV DECISION PROCESSES INCLUDING AN UNKNOWN PARAMETER
- Title not available (Why is that?)
- A modified form of the iterative method of dynamic programming
- Title not available (Why is that?)
- A recursive algorithm in Markovian decision processes
Cited In (13)
- A unified approach to adaptive control of average reward Markov decision processes
- Discretization procedures for adaptive Markov control processes
- Estimation and control in finite Markov decision processes with the average reward criterion
- Average control of Markov decision processes with Feller transition probabilities and general action spaces
- Title not available (Why is that?)
- Approximate receding horizon approach for Markov decision processes: average reward case
- Adaptive Markov control processes
- Adaptive average control for piecewise deterministic Markov processes
- Title not available (Why is that?)
- Approximate gradient methods in policy-space optimization of Markov reward processes
- A forecast horizon and a stopping rule for general Markov decision processes
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
- Title not available (Why is that?)
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3772003)