Pages that link to "Item:Q3768706"
From MaRDI portal
The following pages link to Learning algorithms for Markov decision processes (Q3768706):
Displayed 7 items.
- Adaptive control of Markov chains with local updates (Q913734) (← links)
- A unified approach to adaptive control of average reward Markov decision processes (Q1095048) (← links)
- Computationally efficient algorithms for on-line optimization of Markov decision processes (Q1190506) (← links)
- Statistical inference for a finite optimal stopping problem with unknown transition probabilities (Q1423869) (← links)
- Central limit theorem for the estimator of the value of an optimal stopping problem (Q2387145) (← links)
- (Q3971601) (← links)
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139) (← links)