Pages that link to "Item:Q682295"
From MaRDI portal
The following pages link to Targeted sequential design for targeted learning inference of the optimal treatment rule and its mean reward (Q682295):
Displaying 4 items.
- Performance guarantees for policy learning (Q2227481) (← links)
- Statistical Inference for Online Decision Making via Stochastic Gradient Descent (Q4999148) (← links)
- Statistical Inference for Online Decision Making: In a Contextual Bandit Setting (Q5857145) (← links)
- Relaxing the i.i.d. assumption: adaptively minimax optimal regret via root-entropic regularization (Q6183761) (← links)