Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153)
From MaRDI portal
scientific article; zbMATH DE number 7329750
Language | Label | Description | Also known as |
---|---|---|---|
English | Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health |
scientific article; zbMATH DE number 7329750 |
Statements
Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (English)
0 references
30 March 2021
0 references
Markov decision process
0 references
policy evaluation
0 references
reinforcement learning
0 references
sequential decision making
0 references