Publication:4034339
From MaRDI portal
zbMath0795.90082MaRDI QIDQ4034339
Publication date: 16 May 1993
method of successive approximationsstationary policypolicy improvementfirst passage modelone-step reward
Related Items
Constrained Markov decision processes with first passage criteria, First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors, First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs, Finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors, First passage risk probability optimality for continuous time Markov decision processes