Pages that link to "Item:Q5857153"

From MaRDI portal

← Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153)

Jump to:navigation, search

The following pages link to Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153):

Displayed 7 items.

Batch policy learning in average reward Markov decision processes (Q2112817) ‎ (← links)
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets (Q6138596) ‎ (← links)
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process (Q6153991) ‎ (← links)
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning (Q6154019) ‎ (← links)
Statistical Learning for Individualized Asset Allocation (Q6154020) ‎ (← links)
Off-policy evaluation in partially observed Markov decision processes under sequential ignorability (Q6183750) ‎ (← links)
Projected state-action balancing weights for offline reinforcement learning (Q6183753) ‎ (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere"