Pages that link to "Item:Q5704236"
From MaRDI portal
The following pages link to On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies (Q5704236):
Displaying 6 items.
- Stationary anonymous sequential games with undiscounted rewards (Q493048) (← links)
- Simulation-based optimization of Markov decision processes: an empirical process theory approach (Q608432) (← links)
- Fast convergence to state-action frequency polytopes for MDPs (Q1015315) (← links)
- Acceptable strategy profiles in stochastic games (Q1651302) (← links)
- NP-hardness of checking the unichain condition in average cost MDPs (Q2467470) (← links)
- Fluctuation Bounds for the Max-Weight Policy with Applications to State Space Collapse (Q5126318) (← links)