Reliable off-policy evaluation for reinforcement learning (Q6579655)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Reliable off-policy evaluation for reinforcement learning |
scientific article; zbMATH DE number 7887720
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Reliable off-policy evaluation for reinforcement learning |
scientific article; zbMATH DE number 7887720 |
Statements
Reliable off-policy evaluation for reinforcement learning (English)
0 references
25 July 2024
0 references
uncertainty quantification
0 references
reinforcement learning
0 references
Wasserstein robust optimization
0 references
0.8213511109352112
0 references
0.8125923275947571
0 references
0.7769107222557068
0 references
0.7709460854530334
0 references