Projected state-action balancing weights for offline reinforcement learning (Q6183753)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Projected state-action balancing weights for offline reinforcement learning |
scientific article; zbMATH DE number 7783513
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Projected state-action balancing weights for offline reinforcement learning |
scientific article; zbMATH DE number 7783513 |
Statements
Projected state-action balancing weights for offline reinforcement learning (English)
0 references
4 January 2024
0 references
infinite horizons
0 references
Markov decision process
0 references
policy evaluation
0 references
reinforcement learning
0 references
0 references
0 references
0 references
0 references
0 references
0 references
0.7935605049133301
0 references
0.7855165600776672
0 references
0.7769107222557068
0 references
0.7653833627700806
0 references
0.7500125169754028
0 references