Unbounded dynamic programming via the Q-transform

From MaRDI portal
Publication:2138381

DOI10.1016/J.JMATECO.2022.102652zbMATH Open1490.91073arXiv2012.00219OpenAlexW4211248567MaRDI QIDQ2138381FDOQ2138381


Authors: Qingyin Ma, John Stachurski, Alexis Akira Toda Edit this on Wikidata


Publication date: 11 May 2022

Published in: Journal of Mathematical Economics (Search for Journal in Brave)

Abstract: We propose a new approach to solving dynamic decision problems with unbounded rewards based on the transformations used in Q-learning. In our case, the objective of the transform is to convert an unbounded dynamic program into a bounded one. The approach is general enough to handle problems for which existing methods struggle, and yet simple relative to other techniques and accessible for applied work. We show by example that many common decision problems satisfy our conditions.


Full work available at URL: https://arxiv.org/abs/2012.00219




Recommendations




Cites Work


Cited In (5)





This page was built for publication: Unbounded dynamic programming via the Q-transform

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2138381)