Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets

From MaRDI portal
Publication:5145843


DOI10.1613/jair.1.12270zbMath1497.68414MaRDI QIDQ5145843

Yongcan Cao, Huixin Zhan

Publication date: 22 January 2021

Published in: Journal of Artificial Intelligence Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1613/jair.1.12270


90C29: Multi-objective and goal programming

68T05: Learning and adaptive systems in artificial intelligence

90C40: Markov and semi-Markov decision processes



Uses Software


Cites Work