Adaptive control of discounted Markov decision chains
From MaRDI portal
(Redirected from Publication:796461)
Recommendations
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- scientific article; zbMATH DE number 4045510
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- scientific article; zbMATH DE number 4123661
- scientific article; zbMATH DE number 970511
Cites work
- scientific article; zbMATH DE number 3843637 (Why is no real title available?)
- scientific article; zbMATH DE number 3686524 (Why is no real title available?)
- scientific article; zbMATH DE number 3579744 (Why is no real title available?)
- scientific article; zbMATH DE number 3320878 (Why is no real title available?)
- scientific article; zbMATH DE number 3338194 (Why is no real title available?)
- scientific article; zbMATH DE number 3378668 (Why is no real title available?)
- Adaptive control of service in queueing systems
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Convergence analysis of parametric identification methods
- Dynamic programming and stochastic control
- Estimation and control in Markov chains
- Nonstationary Markov decision problems with converging parameters
- Optimal adaptive control of priority assignment in queueing systems
- Strongly consistent estimation in a controlled Markov renewal model
- The average-optimal adaptive control of a Markov renewal model in presence of an unknown parameter
Cited in
(33)- Minimizing the learning loss in adaptive control of Markov chains under the weak accessibility condition
- Estimation and control in discounted stochastic dynamic programming
- scientific article; zbMATH DE number 4112513 (Why is no real title available?)
- A unified approach to adaptive control of average reward Markov decision processes
- Adaptive control of average Markov decision chains under the Lyapunov stability condition
- Stability estimation of some Markov controlled processes
- Discretization procedures for adaptive Markov control processes
- Adaptive policies for stochastic systems under a randomized discounted cost criterion
- scientific article; zbMATH DE number 4045510 (Why is no real title available?)
- scientific article; zbMATH DE number 4003941 (Why is no real title available?)
- Adaptive control of constrained Markov chains: Criteria and policies
- Finite-state approximations for denumerable state discounted Markov decision processes
- Adaptive Markov control processes
- scientific article; zbMATH DE number 4003938 (Why is no real title available?)
- Optimal cost and policy for a Markovian replacement problem
- Identification and control in the partially known Merton portfolio selection model
- Recursive adaptive control of Markov decision processes with the average reward criterion
- Nonparametric estimation and adaptive control in a class of finite Markov decision chains
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- Adaptive discounted control for piecewise deterministic Markov processes
- Adaptive average control for piecewise deterministic Markov processes
- Nonparametric adaptive control of discounted stochastic systems with compact state space
- Adaptive control of diffusion processes with a discounted reward criterion
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Finite-state approximations for denumerable multidimensional state discounted Markov decision processes
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- Comparing Policies in Markov Decision Processes: Mandl's Lemma Revisited
- Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion.
- Adaptive control of Markov processes with incomplete state information and unknown parameters
- Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria
- Nonparametric adaptive control of discrete-time partially observable stochastic systems
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
- Statistical inference for a finite optimal stopping problem with unknown transition probabilities
This page was built for publication: Adaptive control of discounted Markov decision chains
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q796461)