Pages that link to "Item:Q2834459"

From MaRDI portal

Jump to:navigation, search

The following pages link to (Q2834459):

Displayed 9 items.

Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) ‎ (← links)
Batch policy learning in average reward Markov decision processes (Q2112817) ‎ (← links)
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) ‎ (← links)
Learning When-to-Treat Policies (Q5857115) ‎ (← links)
Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153) ‎ (← links)
A mathematical perspective of machine learning (Q6118171) ‎ (← links)
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets (Q6138596) ‎ (← links)
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning (Q6154019) ‎ (← links)
Projected state-action balancing weights for offline reinforcement learning (Q6183753) ‎ (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q2834459"