The following pages link to (Q2933945):
Displaying 9 items.
- Doubly robust policy evaluation and optimization (Q252797) (← links)
- Recovering Markov models from closed-loop data (Q1737809) (← links)
- Variational learning from implicit bandit feedback (Q2071347) (← links)
- Lessons on off-policy methods from a notification component of a chatbot (Q2071403) (← links)
- On obtaining sparse semantic solutions for inverse problems, control, and neural network training (Q2132578) (← links)
- Augmented direct learning for conditional average treatment effect estimation with double robustness (Q2154959) (← links)
- Invariance, causality and robustness (Q2218071) (← links)
- Constructing effective personalized policies using counterfactual inference from biased data sets with many features (Q2425241) (← links)
- Computing Large Market Equilibria Using Abstractions (Q5031015) (← links)