Lipschitz continuity of value functions in Markovian decision processes
From MaRDI portal
Publication:814878
DOI10.1007/s00186-005-0438-1zbMath1093.90075OpenAlexW2064778949MaRDI QIDQ814878
Publication date: 8 February 2006
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s00186-005-0438-1
Markov and semi-Markov decision processes (90C40) Nonlinear spectral theory, nonlinear eigenvalue problems (47J10)
Related Items
Lipschitz recursive equilibrium with a minimal state space and heterogeneous agents ⋮ Robustness and sample complexity of model-based MARL for general-sum Markov games ⋮ First-order sensitivity of the optimal value in a Markov decision model with respect to deviations in the transition probability function ⋮ Approximation of Markov decision processes with general state space ⋮ Unnamed Item ⋮ A constructive geometrical approach to the uniqueness of Markov stationary equilibrium in stochastic games of intergenerational altruism ⋮ Generalized envelope theorems: applications to dynamic programming ⋮ Computable approximations for average Markov decision processes in continuous time ⋮ Policy gradient in Lipschitz Markov decision processes ⋮ Unnamed Item ⋮ Stochastic approximations of constrained discounted Markov decision processes ⋮ Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis ⋮ A stability result for linear Markovian stochastic optimization problems