The following pages link to (Q4888202):
Displayed 4 items.
- The convergence of value iteration in average cost Markov decision chains (Q2564235) (← links)
- Average optimality for Markov decision processes in borel spaces: a new condition and approach (Q3410916) (← links)
- Denumerable controlled Markov chains with average reward criterion: Sample path optimality (Q4698121) (← links)
- Asymptotic behavior of the value functions of discrete-time discounted optimal control (Q5947269) (← links)