Pages that link to "Item:Q2151247"
From MaRDI portal
The following pages link to Value iteration for long-run average reward in Markov decision processes (Q2151247):
Displaying 6 items.
- Economic design of memory-type control charts: the fallacy of the formula proposed by Lorenzen and Vance (1986) (Q1995869) (← links)
- Multi-objective optimization of long-run average and total rewards (Q2044201) (← links)
- Markov automata with multiple objectives (Q2151241) (← links)
- Value iteration for simple stochastic games: stopping criterion and learning algorithm (Q2672267) (← links)
- (Q5875366) (← links)
- (Q5875369) (← links)