Pages that link to "Item:Q1945130"
From MaRDI portal
The following pages link to Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130):
Displaying 9 items.
- Preferences in artificial intelligence (Q314443) (← links)
- Global optimization based on active preference learning with radial basis functions (Q2051251) (← links)
- A one-bit, comparison-based gradient estimator (Q2155805) (← links)
- Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm (Q2514758) (← links)
- (Q4637066) (← links)
- Active Inference: Demystified and Compared (Q5004319) (← links)
- Deterministic policies based on maximum regrets in MDPs with imprecise rewards (Q5069649) (← links)
- Reinforcement learning (Q6602227) (← links)
- Preference learning and multiple criteria decision aiding: differences, commonalities, and synergies. II (Q6614639) (← links)