Estimation and control in discounted stochastic dynamic programming

Queues and service in operations research (90B22) Dynamic programming (90C39) Queueing theory (aspects of probability theory) (60K25) Markov and semi-Markov decision processes (90C40)

Recommendations

scientific article; zbMATH DE number 4074842
scientific article; zbMATH DE number 4003938
Adaptive control of discounted Markov decision chains
Estimation and control in multichain processes
Nonparametric estimation and adaptive control in a class of finite Markov decision chains

Cites work

scientific article; zbMATH DE number 3604231 (Why is no real title available?)
scientific article; zbMATH DE number 3313523 (Why is no real title available?)
scientific article; zbMATH DE number 3320878 (Why is no real title available?)
A characterization of geometric ergodicity
Adaptive control of Markov chains, I: Finite parameter set
Bounds for the regret loss in dynamic programming under adaptive control
Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion
Estimation and control in Markov chains
Markov decision processes and strongly excessive functions
On Dynamic Programming with Unbounded Rewards
Strongly consistent estimation in a controlled Markov renewal model

Cited in

(41)

The actor-critic algorithm as multi-time-scale stochastic approximation.
Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion
Stability estimation of some Markov controlled processes
Discretization procedures for adaptive Markov control processes
Analysis of an identification algorithm arising in the adaptive estimation of Markov chains
Ergodic control of multidimensional diffusions. II: Adaptive control
scientific article; zbMATH DE number 179178 (Why is no real title available?)
Minimum contrast estimators for piecewise deterministic Markov processes
scientific article; zbMATH DE number 179177 (Why is no real title available?)
scientific article; zbMATH DE number 4074842 (Why is no real title available?)
Adaptive control of constrained Markov chains: Criteria and policies
Estimation of the optimality deviation in discounted semi-Markov control models
Markov control models with unknown random state-action-dependent discount factors
scientific article; zbMATH DE number 4003938 (Why is no real title available?)
Identification and control in the partially known Merton portfolio selection model
Recursive adaptive control of Markov decision processes with the average reward criterion
Nonparametric estimation and adaptive control in a class of finite Markov decision chains
scientific article; zbMATH DE number 7232788 (Why is no real title available?)
Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
Adaptive discounted control for piecewise deterministic Markov processes
Adaptive average control for piecewise deterministic Markov processes
Allocation of Control Points in Stochastic Dynamic-Programming Models
Adaptive control of continuous-time linear stochastic systems with discounted cost criterion
Nonparametric adaptive control of discounted stochastic systems with compact state space
Adaptive control of diffusion processes with a discounted reward criterion
Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
Finite-state approximations for denumerable multidimensional state discounted Markov decision processes
Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
Estimating the value of a discounted reward process
Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion.
Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion
Discounting long run average growth in stochastic dynamic programs
Non-stationary value iteration for adaptive average control of piecewise deterministic Markov processes
The Kumar-Becker-Lin scheme revisited
Q-learning for Markov decision processes with a satisfiability criterion
Controlled approximation of the value function in stochastic dynamic programming for multi-reservoir systems
Controlling a Stochastic Process with Unknown Parameters
Sensitivity of constrained Markov decision processes
Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria
Nonparametric adaptive control of discrete-time partially observable stochastic systems
Density estimation and adaptive control of Markov processes: Average and discounted criteria

This page was built for publication: Estimation and control in discounted stochastic dynamic programming

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3758580)