Pessimistic value iteration for multi-task data sharing in offline reinforcement learning
From MaRDI portal
Publication:6152665
DOI10.1016/J.ARTINT.2023.104048MaRDI QIDQ6152665FDOQ6152665
Authors:
Publication date: 13 February 2024
Published in: Artificial Intelligence (Search for Journal in Brave)
Recommendations
- Offline reinforcement learning with representations for actions
- Offline reinforcement learning with task hierarchies
- Exploiting action impact regularity and exogenous state variables for offline reinforcement learning
- Settling the sample complexity of model-based offline reinforcement learning
- Reliable off-policy evaluation for reinforcement learning
Cites Work
Cited In (2)
This page was built for publication: Pessimistic value iteration for multi-task data sharing in offline reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6152665)