Pages that link to "Item:Q5307594"

From MaRDI portal

← Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path (Q5307594)

Jump to:navigation, search

The following pages link to Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path (Q5307594):

Displayed 5 items.

Approximation of Markov decision processes with general state space (Q663675) ‎ (← links)
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path (Q1009248) ‎ (← links)
An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868) ‎ (← links)
Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) ‎ (← links)
A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning (Q2633537) ‎ (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q5307594"