A survey of algorithmic methods for partially observed Markov decision processes

From MaRDI portal

Publication:2638960

Jump to:navigation, search

DOI10.1007/BF02055574zbMath0717.90086MaRDI QIDQ2638960

William S. Lovejoy

Publication date: 1991

Published in: Annals of Operations Research (Search for Journal in Brave)

zbMATH Keywords

survey incomplete information partially observed Markov decision process finite and infinite horizons approximation methodologies

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02) Computational methods for problems pertaining to operations research and mathematical programming (90-08)

Related Items

A two-state partially observable Markov decision process with three actions, Dynamic Pricing and Learning with Finite Inventories, Model Checking Linear-Time Properties of Probabilistic Systems, Admission Control Policies in a Finite Capacity Geo/Geo/1 Queue Under Partial State Observations, Planning for multiple measurement channels in a continuous-state POMDP, An efficient heuristic for a partially observable Markov decision process of machine replacement, Control limits for two-state partially observable Markov decision processes, Asymptotically optimal Bayesian sequential change detection and identification rules, A unified model of qualitative belief change: a dynamical systems perspective, Cops and invisible robbers: the cost of drunkenness, Dynamic Learning and Decision Making via Basis Weight Vectors, BOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHM, Abstraction and approximate decision-theoretic planning., Finding optimal memoryless policies of POMDPs under the expected average reward criterion, Affect control processes: intelligent affective interaction using a partially observable Markov decision process, Computation of approximate optimal policies in a partially observed inventory model with rain checks, A unified framework for stochastic optimization, SLAP: specification logic of actions with probability, State observation accuracy and finite-memory policy performance, Markov limid processes for representing and solving renewal problems, Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors, An Approximation Approach for Response-Adaptive Clinical Trial Design, An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes, A survey of decision making and optimization under uncertainty, Unnamed Item, Markov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information), Modelling of hydrological persistence for hidden state Markov decision processes, Planning and acting in partially observable stochastic domains, Performance prediction of an unmanned airborne vehicle multi-agent system, Heuristic anytime approaches to stochastic decision processes, Partially observable Markov decision process approximations for adaptive sensing, Probabilistic Acceptors for Languages over Infinite Words, Optimal condition based maintenance with imperfect information and the proportional hazards model, Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy, A simple suboptimal algorithm for system maintance under partial observability, Selecting a quality control attribute sample: An information-economics method, Partially observable Markov decision processes with imprecise parameters, A Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with applications in partially observable Markov decision processes, Predictive control of discrete time stochastic nonlinear state space dynamical systems: a particle nonparametric approach, Influence of modeling structure in probabilistic sequential decision problems, A tutorial on partially observable Markov decision processes, Optimizing active surveillance for prostate cancer using partially observable Markov decision processes, Stochastic dynamic programming with factored representations, Stratified breast cancer follow-up using a continuous state partially observable Markov decision process, A simulation-based approach to stochastic dynamic programming, Optimal decisions in stochastic graphs with uncorrelated and correlated edge weights, A leader-follower partially observed, multiobjective Markov game, Value of information for a leader-follower partially observed Markov game, Optimal sensor scheduling for hidden Markov model state estimation

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2638960&oldid=15442070"