Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (Q6152665)
From MaRDI portal
scientific article; zbMATH DE number 7803985
Language | Label | Description | Also known as |
---|---|---|---|
English | Pessimistic value iteration for multi-task data sharing in offline reinforcement learning |
scientific article; zbMATH DE number 7803985 |
Statements
Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (English)
0 references
13 February 2024
0 references
uncertainty quantification
0 references
data sharing
0 references
pessimistic value iteration
0 references
offline reinforcement learning
0 references
0 references