Stochastic dynamic programming with non-linear discounting

DOI10.1007/S00245-020-09731-XMaRDI QIDQ2234309zbMATH OpenOpenAlexFDO

Authors Nicole Bäuerle, Anna Jaskiewicz, Andrzej S. Nowak

Publication date 19 October 2021

Published in Applied Mathematics and Optimization (Search for Journal in Brave)

Copyright license Creative Commons Attribution 4.0 International

Full work available at URL https://arxiv.org/abs/2011.02239

zbMATH Keywords

Bellman equation stochastic dynamic programming optimal stationary policy nonlinear discounting

Mathematics Subject Classification ID

Dynamic programming (90C39) Resource and cost allocation (including fair division, apportionment, etc.) (91B32) Discrete-time Markov processes on general state spaces (60J05) Markov and semi-Markov decision processes (90C40) Economic growth models (91B62)

Abstract: In this paper, we study a Markov decision process with a non-linear discount function and with a Borel state space. We define a recursive discounted utility, which resembles non-additive utility functions considered in a number of models in economics. Non-additivity here follows from non-linearity of the discount function. Our study is complementary to the work of Ja'skiewicz, Matkowski and Nowak (Math. Oper. Res. 38 (2013), 108-121), where also non-linear discounting is used in the stochastic setting, but the expectation of utilities aggregated on the space of all histories of the process is applied leading to a non-stationary dynamic programming model. Our aim is to prove that in the recursive discounted utility case the Bellman equation has a solution and there exists an optimal stationary policy for the problem in the infinite time horizon. Our approach includes two cases:

(a)

when the one-stage utility is bounded on both sides by a weight function multiplied by some positive and negative constants, and

(b)

when the one-stage utility is unbounded from below.

Recommendations

Cites work

Cited in

(11)

This page was built for publication: Stochastic dynamic programming with non-linear discounting

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2234309)