Publication:3093188
From MaRDI portal
zbMath1222.68256MaRDI QIDQ3093188
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v5/mannor04a.html
90C29: Multi-objective and goal programming
68T05: Learning and adaptive systems in artificial intelligence
91A40: Other game-theoretic models
91A15: Stochastic games, stochastic differential games
Related Items
Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets, Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search, Approachability in Stackelberg stochastic games with vector costs, Preference-based reinforcement learning: a formal framework and a policy iteration algorithm, An actor-critic algorithm for constrained Markov decision processes
Uses Software