The following pages link to (Q2880979):
Displayed 14 items.
- Extreme state aggregation beyond Markov decision processes (Q329613) (← links)
- Hybrid answer set programming (Q392277) (← links)
- Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model (Q399890) (← links)
- Knows what it knows: a framework for self-aware learning (Q413843) (← links)
- Near-optimal PAC bounds for discounted MDPs (Q465258) (← links)
- Reducing reinforcement learning to KWIK online regression (Q616761) (← links)
- Solving for Best Responses and Equilibria in Extensive-Form Games with Reinforcement Learning Methods (Q3299845) (← links)
- Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes (Q4607932) (← links)
- (Q4998915) (← links)
- (Q5053310) (← links)
- (Q5149240) (← links)
- (Q5214220) (← links)
- Identity concealment games: how I learned to stop revealing and love the coincidences (Q6119741) (← links)
- Recent advances in reinforcement learning in finance (Q6146668) (← links)