The following pages link to (Q3623997):
Displayed 8 items.
- A new representation and associated algorithms for generalized planning (Q543617) (← links)
- Reducing reinforcement learning to KWIK online regression (Q616761) (← links)
- The factored policy-gradient planner (Q835832) (← links)
- Practical solution techniques for first-order MDPs (Q835833) (← links)
- APPSSAT: Approximate probabilistic planning using stochastic satisfiability (Q997058) (← links)
- Structured machine learning: the next ten years (Q1009285) (← links)
- Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130) (← links)
- Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies (Q3646118) (← links)