Convergence of Markov decision processes with constraints and state-action dependent discount factors

From MaRDI portal
Publication:2301208

DOI10.1007/s11425-017-9292-1zbMath1433.90185OpenAlexW2915384137MaRDI QIDQ2301208

Xiao Wu, Xianping Guo

Publication date: 28 February 2020

Published in: Science China. Mathematics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s11425-017-9292-1




Related Items (max. 100)



Cites Work


This page was built for publication: Convergence of Markov decision processes with constraints and state-action dependent discount factors