Pages that link to "Item:Q5060503"
From MaRDI portal
The following pages link to Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning (Q5060503):
Displaying 3 items.
- A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets (Q6138596) (← links)
- Off-policy evaluation in partially observed Markov decision processes under sequential ignorability (Q6183750) (← links)
- Projected state-action balancing weights for offline reinforcement learning (Q6183753) (← links)