scientific article
From MaRDI portal
Publication:3093188
zbMath1222.68256MaRDI QIDQ3093188
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v5/mannor04a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Multi-objective and goal programming (90C29) Learning and adaptive systems in artificial intelligence (68T05) Other game-theoretic models (91A40) Stochastic games, stochastic differential games (91A15)
Related Items
Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search, Efficient multi-objective neural architecture search framework via policy gradient algorithm, Preference-based reinforcement learning: a formal framework and a policy iteration algorithm, Approachability in Stackelberg stochastic games with vector costs, Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets, An actor-critic algorithm for constrained Markov decision processes
Uses Software