The following pages link to R-MAX (Q15078):
Displayed 33 items.
- Belief and truth in hypothesised behaviours (Q274416) (← links)
- Knows what it knows: a framework for self-aware learning (Q413843) (← links)
- Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning (Q413845) (← links)
- Model selection in reinforcement learning (Q415618) (← links)
- Reducing reinforcement learning to KWIK online regression (Q616761) (← links)
- Efficient learning equilibrium (Q814627) (← links)
- On the possibility of learning in reactive environments with arbitrary dependence (Q950202) (← links)
- An analysis of model-based interval estimation for Markov decision processes (Q959899) (← links)
- Adaptive representations for reinforcement learning. (Q980851) (← links)
- If multi-agent learning is the answer, what is the question? (Q1028919) (← links)
- Perspectives on multiagent learning (Q1028921) (← links)
- Reinforcement learning agents (Q1600659) (← links)
- Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
- Guiding exploration by pre-existing knowledge without modifying reward (Q2383522) (← links)
- (Q2880957) (← links)
- (Q2880979) (← links)
- (Q2896090) (← links)
- Uncertainty Propagation for Efficient Exploration in Reinforcement Learning (Q2999161) (← links)
- (Q3046711) (← links)
- A Monte-Carlo AIXI Approximation (Q3081449) (← links)
- (Q3093188) (← links)
- (Q3096210) (← links)
- Markov Decision Processes with Arbitrary Reward Processes (Q3169064) (← links)
- Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence (Q3522996) (← links)
- A Minimum Relative Entropy Principle for Learning and Acting (Q3588636) (← links)
- Algorithms for Reinforcement Learning (Q3588852) (← links)
- (Q4434166) (← links)
- Learning Theory (Q4680907) (← links)
- (Q4813547) (← links)
- Cooperative learning with joint state value approximation for multi-agent systems (Q4980275) (← links)
- (Q5053336) (← links)
- Bounded Parameter Markov Decision Processes with Average Reward Criterion (Q5434055) (← links)
- Machine Learning: ECML 2004 (Q5450729) (← links)