Pages that link to "Item:Q2318167"
From MaRDI portal
The following pages link to Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies (Q2318167):
Displayed 4 items.
- Learning Machiavellian strategies for manipulation in Stackelberg security games (Q2122770) (← links)
- A Markovian Stackelberg game approach for computing an optimal dynamic mechanism (Q2245712) (← links)
- A Lyapunov approach for stable reinforcement learning (Q2675741) (← links)
- A Bayesian reinforcement learning approach in Markov games for computing near-optimal policies (Q6059222) (← links)