scientific article; zbMATH DE number 6148092
From MaRDI portal
Publication:4909777
zbMath1274.90474MaRDI QIDQ4909777
J. Adolfo Minjárez-Sosa, Evgueni I. Gordienko
Publication date: 21 March 2013
Full work available at URL: http://www.kybernetika.cz/content/1998/2
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
rate of convergenceunbounded costsdensity estimatorMarkov control processdiscounted asymptotic optimality
Markov processes: estimation; hidden Markov models (62M05) Markov and semi-Markov decision processes (90C40)
Related Items (15)
Bayesian estimation of the mean holding time in average semi-Markov control processes ⋮ Unnamed Item ⋮ Discrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approach ⋮ Average optimal strategies for zero-sum Markov games with poorly known payoff function on one side ⋮ Markov control models with unknown random state-action-dependent discount factors ⋮ Estimation of the Optimality Deviation in Discounted Semi-Markov Control Models ⋮ Semi-Markov control processes with unknown holding times distribution under an average cost criterion ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion ⋮ Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs ⋮ Unnamed Item ⋮ Limiting optimal discounted-cost control of a class of time-varying stochastic systems ⋮ Approximation and mean field control of systems of large populations ⋮ Stability estimation of some Markov controlled processes ⋮ Partially observable Markov decision processes with partially observable random discount factors
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Adaptive control of discounted Markov decision chains
- Nonparametric adaptive control of discounted stochastic systems with compact state space
- On density estimation in the view of Kolmogorov's ideas in approximation theory
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution
- Measurable selection theorems for optimization problems
- Approximation of average cost optimal policies for general Markov decision processes with unbounded costs
- Adaptive Markov control processes
- On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
- Analysis of an adaptive control scheme for a partially observed controlled Markov chain
- Adaptive Strategies for Certain Classes of Controlled Markov Processes
- Estimation and control in discounted stochastic dynamic programming
- Minimizing the learning loss in adaptive control of Markov chains under the weak accessibility condition
- On Dynamic Programming with Unbounded Rewards
- Note—A Note on Dynamic Programming with Unbounded Rewards
- Infinite-horizon Markov control processes with undiscounted cost criteria: from average to overtaking optimality
- Estimation and control in Markov chains
- Average cost Markov control processes with weighted norms: value iteration
This page was built for publication: