A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion

From MaRDI portal
Publication:5130923

DOI10.1137/19M1255811zbMath1452.90316arXiv1903.08853OpenAlexW2924486842MaRDI QIDQ5130923

Alexandre Genadot, François Dufour

Publication date: 30 October 2020

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1903.08853



Related Items



Cites Work