Pages that link to "Item:Q3768706"

From MaRDI portal

← Learning algorithms for Markov decision processes (Q3768706)

Jump to:navigation, search

The following pages link to Learning algorithms for Markov decision processes (Q3768706):

Displayed 7 items.

Adaptive control of Markov chains with local updates (Q913734) ‎ (← links)
A unified approach to adaptive control of average reward Markov decision processes (Q1095048) ‎ (← links)
Computationally efficient algorithms for on-line optimization of Markov decision processes (Q1190506) ‎ (← links)
Statistical inference for a finite optimal stopping problem with unknown transition probabilities (Q1423869) ‎ (← links)
Central limit theorem for the estimator of the value of an optimal stopping problem (Q2387145) ‎ (← links)
(Q3971601) ‎ (← links)
Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139) ‎ (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q3768706"