Projected state-action balancing weights for offline reinforcement learning (Q6183753)
From MaRDI portal
scientific article; zbMATH DE number 7783513
Language | Label | Description | Also known as |
---|---|---|---|
English | Projected state-action balancing weights for offline reinforcement learning |
scientific article; zbMATH DE number 7783513 |
Statements
Projected state-action balancing weights for offline reinforcement learning (English)
0 references
4 January 2024
0 references
infinite horizons
0 references
Markov decision process
0 references
policy evaluation
0 references
reinforcement learning
0 references
0 references
0 references
0 references
0 references
0 references