Pages that link to "Item:Q1945130"
From MaRDI portal
The following pages link to Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130):
Displaying 5 items.
- Preferences in artificial intelligence (Q314443) (← links)
- Global optimization based on active preference learning with radial basis functions (Q2051251) (← links)
- A one-bit, comparison-based gradient estimator (Q2155805) (← links)
- Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm (Q2514758) (← links)
- Active Inference: Demystified and Compared (Q5004319) (← links)