R-MAX
From MaRDI portal
Software:15078
swMATH2539MaRDI QIDQ15078FDOQ15078
Author name not available (Why is that?)
Cited In (33)
- An analysis of model-based interval estimation for Markov decision processes
- Reinforcement learning in finite MDPs: PAC analysis
- Near-optimal regret bounds for reinforcement learning
- Belief and truth in hypothesised behaviours
- Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning
- Model selection in reinforcement learning
- Adaptive representations for reinforcement learning.
- Uncertainty Propagation for Efficient Exploration in Reinforcement Learning
- Title not available (Why is that?)
- A Monte-Carlo AIXI approximation
- Learning Theory
- Multi-agent reinforcement learning in common interest and fixed sum stochastic games: an experimental study
- Reducing reinforcement learning to KWIK online regression
- Title not available (Why is that?)
- Perspectives on multiagent learning
- Guiding exploration by pre-existing knowledge without modifying reward
- On the possibility of learning in reactive environments with arbitrary dependence
- A minimum relative entropy principle for learning and acting
- Algorithms for reinforcement learning.
- Reinforcement learning agents
- Provably efficient learning with typed parametric models
- Title not available (Why is that?)
- Machine Learning: ECML 2004
- Multi-agent reinforcement learning: a selective overview of theories and algorithms
- Title not available (Why is that?)
- Title not available (Why is that?)
- Markov decision processes with arbitrary reward processes
- Cooperative learning with joint state value approximation for multi-agent systems
- Knows what it knows: a framework for self-aware learning
- Bounded Parameter Markov Decision Processes with Average Reward Criterion
- If multi-agent learning is the answer, what is the question?
- Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence
- Efficient learning equilibrium
This page was built for software: R-MAX