Mechanizing soundness of off-policy evaluation (Q6572575)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Mechanizing soundness of off-policy evaluation |
scientific article; zbMATH DE number 7881145
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Mechanizing soundness of off-policy evaluation |
scientific article; zbMATH DE number 7881145 |
Statements
Mechanizing soundness of off-policy evaluation (English)
0 references
15 July 2024
0 references
formal methods
0 references
HOL4
0 references
reinforcement learning
0 references
off-policy evaluation
0 references
concentration inequality
0 references
Hoeffding
0 references