scientific article

From MaRDI portal

Publication:3093188

Jump to:navigation, search

zbMath1222.68256MaRDI QIDQ3093188

Shie Mannor, Nahum Shimkin

Publication date: 12 October 2011

Full work available at URL: http://www.jmlr.org/papers/v5/mannor04a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

Mathematics Subject Classification ID

Multi-objective and goal programming (90C29) Learning and adaptive systems in artificial intelligence (68T05) Other game-theoretic models (91A40) Stochastic games, stochastic differential games (91A15)

Related Items

Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search, Efficient multi-objective neural architecture search framework via policy gradient algorithm, Preference-based reinforcement learning: a formal framework and a policy iteration algorithm, Approachability in Stackelberg stochastic games with vector costs, Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets, An actor-critic algorithm for constrained Markov decision processes

Uses Software

R-MAX

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3093188&oldid=16170660"