scientific article
From MaRDI portal
Publication:3772003
zbMATH Open0633.90091MaRDI QIDQ3772003FDOQ3772003
Publication date: 1987
Full work available at URL: https://eudml.org/doc/28802
Title of this publication is not available (Why is that?)
unknown parametersapproximation proceduresvalue-iterationaverage-reward controlled Markov processesBorel state and control spacesoptimal adaptive policies
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Dynamic programming, Markov chains, and the method of successive approximations
- Stochastic optimal control. The discrete time case
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- A Survey of Some Results in Stochastic Adaptive Control
- The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms
- Optimal Plans for Dynamic Programming Problems
- Estimation and control in Markov chains
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- Optimal adaptive control of priority assignment in queueing systems
- Adaptive control of service in queueing systems
- Nonstationary Markov decision problems with converging parameters
- Nonparametric adaptive control of discrete-time partially observable stochastic systems
- The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Adaptive Strategies for Certain Classes of Controlled Markov Processes
- AVERAGE-OPTIMAL ADAPTIVE POLICIES IN SEMI-MARKOV DECISION PROCESSES INCLUDING AN UNKNOWN PARAMETER
- A modified form of the iterative method of dynamic programming
- A recursive algorithm in Markovian decision processes
Cited In (5)
- Average control of Markov decision processes with Feller transition probabilities and general action spaces
- Adaptive Markov control processes
- Adaptive average control for piecewise deterministic Markov processes
- A forecast horizon and a stopping rule for general Markov decision processes
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3772003)