Pages that link to "Item:Q252797"
From MaRDI portal
The following pages link to Doubly robust policy evaluation and optimization (Q252797):
Displaying 19 items.
- Doubly robust policy evaluation and optimization (Q252797) (← links)
- Constrained Bayesian optimization with noisy experiments (Q1738149) (← links)
- Importance sampling in reinforcement learning with an estimated behavior policy (Q2051319) (← links)
- Optimal policy trees (Q2102338) (← links)
- Toward theoretical understandings of robust Markov decision processes: sample complexity and asymptotics (Q2112808) (← links)
- Batch policy learning in average reward Markov decision processes (Q2112817) (← links)
- PAC-Bayesian lifelong learning for multi-armed bandits (Q2134066) (← links)
- Augmented direct learning for conditional average treatment effect estimation with double robustness (Q2154959) (← links)
- Constructing effective personalized policies using counterfactual inference from biased data sets with many features (Q2425241) (← links)
- Doubly Robust Crowdsourcing (Q5026257) (← links)
- Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning (Q5060503) (← links)
- A Single-Index Model With a Surface-Link for Optimizing Individualized Dose Rules (Q5084453) (← links)
- (Q5148951) (← links)
- (Q5159398) (← links)
- (Q5214237) (← links)
- Nonparametric Causal Effects Based on Incremental Propensity Score Interventions (Q5231493) (← links)
- Learning When-to-Treat Policies (Q5857115) (← links)
- Selecting and Ranking Individualized Treatment Rules With Unmeasured Confounding (Q5857148) (← links)
- A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets (Q6138596) (← links)