Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (Q6152665)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Pessimistic value iteration for multi-task data sharing in offline reinforcement learning |
scientific article; zbMATH DE number 7803985
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Pessimistic value iteration for multi-task data sharing in offline reinforcement learning |
scientific article; zbMATH DE number 7803985 |
Statements
Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (English)
0 references
13 February 2024
0 references
uncertainty quantification
0 references
data sharing
0 references
pessimistic value iteration
0 references
offline reinforcement learning
0 references
0.7438877820968628
0 references
0.7111769914627075
0 references
0.7106821537017822
0 references
0.7041367888450623
0 references
0.6924082040786743
0 references