Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets

From MaRDI portal

Publication:5145843

Jump to:navigation, search

DOI10.1613/jair.1.12270zbMath1497.68414MaRDI QIDQ5145843

Yongcan Cao, Huixin Zhan

Publication date: 22 January 2021

Published in: Journal of Artificial Intelligence Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1613/jair.1.12270

zbMATH Keywords

Markov decision processes; reinforcement learning

Mathematics Subject Classification ID

90C29: Multi-objective and goal programming

68T05: Learning and adaptive systems in artificial intelligence

90C40: Markov and semi-Markov decision processes

Uses Software

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5145843&oldid=19685462"