Pages that link to "Item:Q616761"
From MaRDI portal
The following pages link to Reducing reinforcement learning to KWIK online regression (Q616761):
Displayed 4 items.
- Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm (Q300040) (← links)
- Knows what it knows: a framework for self-aware learning (Q413843) (← links)
- Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains (Q460616) (← links)
- (Q5214215) (← links)