On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes

From MaRDI portal

Publication:2638968

Jump to:navigation, search

DOI10.1007/BF02283610zbMath0717.90094OpenAlexW2107338824WikidataQ60167566 ScholiaQ60167566MaRDI QIDQ2638968

Emmanuel Fernández-Gaucherand, Aristotle Arapostathis, Steven I. Marcus

Publication date: 1991

Published in: Annals of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf02283610

zbMATH Keywords

machine replacement partially observable Markov decision processes uncountable state space average cost optimal policies finite action set discount optimal policies

Mathematics Subject Classification ID

Reliability, availability, maintenance, inspection in operations research (90B25) Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40)

Related Items (6)

A note on the Ross-Taylor theorem ⋮ On the computation of the optimal cost function for discrete time Markov models with partial observations ⋮ Monotonicity properties for two-action partially observable Markov decision processes on partially ordered spaces ⋮ Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes ⋮ OPTIMAL MIXING OF MARKOV DECISION RULES FOR MDP CONTROL ⋮ Long Run Control with Degenerate Observation

Cites Work

This page was built for publication: On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2638968&oldid=15442851"