Pages that link to "Item:Q4027786"
From MaRDI portal
The following pages link to The Optimal Reward Operator in Negative Dynamic Programming (Q4027786):
Displayed 5 items.
- Ashok Prasad Maitra (1938-2008) (Q2431006) (← links)
- Finitely Additive Dynamic Programming (Q2800365) (← links)
- A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies (Q3465941) (← links)
- Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies (Q5130921) (← links)
- On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes (Q5502179) (← links)