Estimation and control in discounted stochastic dynamic programming
From MaRDI portal
Publication:3758580
Recommendations
Cites work
- scientific article; zbMATH DE number 3604231 (Why is no real title available?)
- scientific article; zbMATH DE number 3313523 (Why is no real title available?)
- scientific article; zbMATH DE number 3320878 (Why is no real title available?)
- A characterization of geometric ergodicity
- Adaptive control of Markov chains, I: Finite parameter set
- Bounds for the regret loss in dynamic programming under adaptive control
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion
- Estimation and control in Markov chains
- Markov decision processes and strongly excessive functions
- On Dynamic Programming with Unbounded Rewards
- Strongly consistent estimation in a controlled Markov renewal model
Cited in
(39)- The actor-critic algorithm as multi-time-scale stochastic approximation.
- Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion
- Stability estimation of some Markov controlled processes
- Discretization procedures for adaptive Markov control processes
- Analysis of an identification algorithm arising in the adaptive estimation of Markov chains
- Ergodic control of multidimensional diffusions. II: Adaptive control
- scientific article; zbMATH DE number 179178 (Why is no real title available?)
- scientific article; zbMATH DE number 179177 (Why is no real title available?)
- scientific article; zbMATH DE number 4074842 (Why is no real title available?)
- Adaptive control of constrained Markov chains: Criteria and policies
- Estimation of the optimality deviation in discounted semi-Markov control models
- Markov control models with unknown random state-action-dependent discount factors
- scientific article; zbMATH DE number 4003938 (Why is no real title available?)
- Identification and control in the partially known Merton portfolio selection model
- Recursive adaptive control of Markov decision processes with the average reward criterion
- Nonparametric estimation and adaptive control in a class of finite Markov decision chains
- scientific article; zbMATH DE number 7232788 (Why is no real title available?)
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- Adaptive discounted control for piecewise deterministic Markov processes
- Adaptive average control for piecewise deterministic Markov processes
- Allocation of Control Points in Stochastic Dynamic-Programming Models
- Adaptive control of continuous-time linear stochastic systems with discounted cost criterion
- Nonparametric adaptive control of discounted stochastic systems with compact state space
- Adaptive control of diffusion processes with a discounted reward criterion
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Finite-state approximations for denumerable multidimensional state discounted Markov decision processes
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
- Estimating the value of a discounted reward process
- Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion.
- Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion
- Discounting long run average growth in stochastic dynamic programs
- The Kumar-Becker-Lin scheme revisited
- Q-learning for Markov decision processes with a satisfiability criterion
- Controlled approximation of the value function in stochastic dynamic programming for multi-reservoir systems
- Controlling a Stochastic Process with Unknown Parameters
- Sensitivity of constrained Markov decision processes
- Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria
- Nonparametric adaptive control of discrete-time partially observable stochastic systems
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
This page was built for publication: Estimation and control in discounted stochastic dynamic programming
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3758580)