Pessimistic value iteration for multi-task data sharing in offline reinforcement learning

From MaRDI portal
Publication:6152665












This page was built for publication: Pessimistic value iteration for multi-task data sharing in offline reinforcement learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6152665)