A survey of multi-objective sequential decision-making
From MaRDI portal
Publication:2856473
Learning and adaptive systems in artificial intelligence (68T05) Research exposition (monographs, survey articles) pertaining to computer science (68-02) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Decision theory (91B06) Reasoning under uncertainty in the context of artificial intelligence (68T37)
Abstract: Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.
Recommendations
- Some Theory and an Approach to Solving Sequential Multiple-Criteria Decision Problems
- scientific article; zbMATH DE number 3845375
- An approach for simultaneous finding of multiple efficient decisions in multi-objective optimization problems
- A comparison of interactive multiple-objective decision making procedures
- scientific article; zbMATH DE number 1098939
- Solution procedures for multi-objective markov decision processes
- Multi‐objective combinatorial optimization problems: A survey
- Multiple-choice decision making by multicriteria combinatorial optimization
- Multi-attribute sequential decision problem with optimizing and satisficing attributes
Cited in
(25)- Multi-objective Markov decision processes for data-driven decision support
- Multi-objective dynamic programming with limited precision
- Model-based Reinforcement Learning: A Survey
- Efficient multi-objective reinforcement learning via multiple-gradient descent with iteratively discovered weight-vector sets
- FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural Networks
- Submodular optimization problems and greedy strategies: a survey
- Light robustness in the optimization of Markov decision processes with uncertain parameters
- Reinforcement learning
- Multi-objective reinforcement learning through continuous Pareto manifold approximation
- A Gentle Introduction to Reinforcement Learning
- Competence-aware systems
- Computing multiobjective Markov chains handled by the extraproximal method
- A multi-objective approach for PH-graphs with applications to stochastic shortest paths
- Maximisation of admissible multi-objective heuristics
- Simple strategies in multi-objective MDPs
- A multi-objective approach to the cash management problem
- Multi-cost bounded tradeoff analysis in MDP
- Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
- Necessary and sufficient Karush-Kuhn-Tucker conditions for multiobjective Markov chains optimality
- scientific article; zbMATH DE number 7559459 (Why is no real title available?)
- Constrained multiagent Markov decision processes: a taxonomy of problems and algorithms
- Using the Manhattan distance for computing the multiobjective Markov chains problem
- Reward (Mis)design for autonomous driving
- Multi-objective reinforcement learning using sets of Pareto dominating policies
- Avoiding Negative Side Effects of Autonomous Systems in the Open World
This page was built for publication: A survey of multi-objective sequential decision-making
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2856473)