Pages that link to "Item:Q1136706"
From MaRDI portal
The following pages link to Nonstationary Markov decision problems with converging parameters (Q1136706):
Displaying 22 items.
- Adaptive control of discounted Markov decision chains (Q796461) (← links)
- Existence of optimal policy for time non-homogeneous discounted Markovian decision programming (Q811418) (← links)
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes (Q1068732) (← links)
- Monotone value iteration for discounted finite Markov decision processes (Q1076618) (← links)
- Finite-state approximations for denumerable multidimensional state discounted Markov decision processes (Q1079512) (← links)
- A unified approach to adaptive control of average reward Markov decision processes (Q1095048) (← links)
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution (Q1099125) (← links)
- Nonparametric adaptive control of discrete-time partially observable stochastic systems (Q1122548) (← links)
- Discretization procedures for adaptive Markov control processes (Q1123872) (← links)
- The rate of convergence for backwards products of a convergent sequence of finite Markov matrices (Q1151669) (← links)
- Nonparametric estimation and adaptive control in a class of finite Markov decision chains (Q1174701) (← links)
- Computationally efficient algorithms for on-line optimization of Markov decision processes (Q1190506) (← links)
- On confidence intervals from simulation of finite Markov chains (Q1374692) (← links)
- Recursive adaptive control of Markov decision processes with the average reward criterion (Q2276895) (← links)
- Central limit theorem for the estimator of the value of an optimal stopping problem (Q2387145) (← links)
- Computing Optimal Policies for Markovian Decision Processes Using Simulation (Q2808287) (← links)
- A value-iteration scheme for undiscounted multichain Markov renewal programs (Q3221782) (← links)
- (Q3772003) (← links)
- (Q3971601) (← links)
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139) (← links)
- Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion (Q5476137) (← links)
- Adaptive discounted control for piecewise deterministic Markov processes (Q6136351) (← links)