Pages that link to "Item:Q5704236"

From MaRDI portal

← On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies (Q5704236)

Jump to:navigation, search

The following pages link to On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies (Q5704236):

Displaying 6 items.

Stationary anonymous sequential games with undiscounted rewards (Q493048) (← links)
Simulation-based optimization of Markov decision processes: an empirical process theory approach (Q608432) (← links)
Fast convergence to state-action frequency polytopes for MDPs (Q1015315) (← links)
Acceptable strategy profiles in stochastic games (Q1651302) (← links)
NP-hardness of checking the unichain condition in average cost MDPs (Q2467470) (← links)
Fluctuation Bounds for the Max-Weight Policy with Applications to State Space Collapse (Q5126318) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere"