Pessimistic value iteration for multi-task data sharing in offline reinforcement learning
From MaRDI portal
Publication:6152665
Recommendations
- Offline reinforcement learning with representations for actions
- Offline reinforcement learning with task hierarchies
- Exploiting action impact regularity and exogenous state variables for offline reinforcement learning
- Settling the sample complexity of model-based offline reinforcement learning
- Reliable off-policy evaluation for reinforcement learning
Cites work
Cited in
(2)
This page was built for publication: Pessimistic value iteration for multi-task data sharing in offline reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6152665)