Pages that link to "Item:Q3758580"
From MaRDI portal
The following pages link to Estimation and control in discounted stochastic dynamic programming (Q3758580):
Displaying 29 items.
- Ergodic control of multidimensional diffusions. II: Adaptive control (Q583162) (← links)
- Markov control models with unknown random state-action-dependent discount factors (Q889107) (← links)
- Adaptive control of continuous-time linear stochastic systems with discounted cost criterion (Q912054) (← links)
- Nonparametric adaptive control of discounted stochastic systems with compact state space (Q913742) (← links)
- Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion (Q1021247) (← links)
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes (Q1068732) (← links)
- Finite-state approximations for denumerable multidimensional state discounted Markov decision processes (Q1079512) (← links)
- Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution (Q1099125) (← links)
- Nonparametric adaptive control of discrete-time partially observable stochastic systems (Q1122548) (← links)
- Discretization procedures for adaptive Markov control processes (Q1123872) (← links)
- Adaptive control of constrained Markov chains: Criteria and policies (Q1174698) (← links)
- Nonparametric estimation and adaptive control in a class of finite Markov decision chains (Q1174701) (← links)
- Sensitivity of constrained Markov decision processes (Q1176864) (← links)
- Analysis of an identification algorithm arising in the adaptive estimation of Markov chains (Q1262282) (← links)
- Identification and control in the partially known Merton portfolio selection model (Q1321343) (← links)
- Q-learning for Markov decision processes with a satisfiability criterion (Q1749413) (← links)
- The Kumar-Becker-Lin scheme revisited (Q1824599) (← links)
- Stability estimation of some Markov controlled processes (Q2111840) (← links)
- Recursive adaptive control of Markov decision processes with the average reward criterion (Q2276895) (← links)
- Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria (Q2474553) (← links)
- Density estimation and adaptive control of Markov processes: Average and discounted criteria (Q2639029) (← links)
- (Q3303461) (← links)
- Adaptive control of diffusion processes with a discounted reward criterion (Q3386883) (← links)
- (Q3552450) (← links)
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139) (← links)
- (Q4039208) (← links)
- (Q4909777) (← links)
- The actor-critic algorithm as multi-time-scale stochastic approximation. (Q5955801) (← links)
- Adaptive discounted control for piecewise deterministic Markov processes (Q6136351) (← links)