Pages that link to "Item:Q3116079"
From MaRDI portal
The following pages link to Bias and Variance Approximation in Value Function Estimates (Q3116079):
Displayed 15 items.
- The optimal unbiased value estimator and its relation to LSTD, TD and MC (Q415609) (← links)
- Spectral and graph-theoretic bounds on steady-state-probability estimation performance for an ergodic Markov chain (Q430187) (← links)
- Policy-based branch-and-bound for infinite-horizon multi-model Markov decision processes (Q2026970) (← links)
- An active-set strategy to solve Markov decision processes with good-deal risk measure (Q2329646) (← links)
- Constrained Markov decision processes with uncertain costs (Q2670508) (← links)
- Reinforcement Learning in Robust Markov Decision Processes (Q2833106) (← links)
- Robust MDPs with <i>k</i>-Rectangular Uncertainty (Q2833114) (← links)
- Towards Min Max Generalization in Reinforcement Learning (Q3006026) (← links)
- (Q4636981) (← links)
- Deterministic policies based on maximum regrets in MDPs with imprecise rewards (Q5069649) (← links)
- (Q5148951) (← links)
- Distributionally robust optimization for sequential decision-making (Q5238202) (← links)
- Joint chance-constrained Markov decision processes (Q6160959) (← links)
- Compromise policy for multi-stage stochastic linear programming: variance and bias reduction (Q6164357) (← links)
- Off-policy evaluation for tabular reinforcement learning with synthetic trajectories (Q6190662) (← links)