A survey of multi-objective sequential decision-making

DOI10.1613/JAIR.3987MaRDI QIDQ2856473zbMATH OpenOpenAlexFDO

Authors Diedrik M. Roijers, P. Vamplew, Shimon Whiteson, Richard Dazeley

Publication date 29 October 2013

Published in The Journal of Artificial Intelligence Research (JAIR) (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1402.0590

Learning and adaptive systems in artificial intelligence (68T05) Research exposition (monographs, survey articles) pertaining to computer science (68-02) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Decision theory (91B06) Reasoning under uncertainty in the context of artificial intelligence (68T37)

Abstract: Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.

Recommendations

Cited in

(25)

This page was built for publication: A survey of multi-objective sequential decision-making

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2856473)