State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms

From MaRDI portal

Publication:3947462

Jump to:navigation, search

DOI10.1287/mnsc.28.1.1zbMath0486.90084OpenAlexW2123651102MaRDI QIDQ3947462

George E. Monahan

Publication date: 1982

Published in: Management Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/mnsc.28.1.1

zbMATH Keywords

survey algorithms learning optimal stopping machine maintenance quality control partially observable Markov decision processes finite action space finite state space state information acquisition computation of optimal solutions internal auditing

Mathematics Subject Classification ID

Applications of statistics in engineering and industry; control charts (62P30) Discrete-time Markov processes on general state spaces (60J05) Reliability, availability, maintenance, inspection in operations research (90B25) Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02) Applications of Markov renewal processes (reliability, queueing networks, etc.) (60K20)

Related Items (86)

Optimal sequential file search ⋮ Ambiguous partially observable Markov decision processes: structural results and applications ⋮ Robust set-point regulation for ecological models with multiple management goals ⋮ PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES AND PERIODIC POLICIES WITH APPLICATIONS ⋮ An optimal inspection and replacement policy under incomplete state information ⋮ Partially observable Markov decision processes incorporating ⋮ Stochastic allocation of inspection capacity to competitive processes ⋮ Finite Horizon Decision Timing with Partially Observable Poisson Processes ⋮ Experimental Design for Partially Observed Markov Decision Processes ⋮ Structural results for partially observed control models ⋮ A two-state partially observable Markov decision process with three actions ⋮ On the construction of \(\epsilon\)-optimal strategies in partially observed MDPs ⋮ A survey of algorithmic methods for partially observed Markov decision processes ⋮ On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes ⋮ On model order estimation for partially observed Markov chains ⋮ Dynamic Pricing and Learning with Finite Inventories ⋮ Model Checking Linear-Time Properties of Probabilistic Systems ⋮ Planning for multiple measurement channels in a continuous-state POMDP ⋮ The value of observing the condition of a deteriorating machine ⋮ An efficient heuristic for a partially observable Markov decision process of machine replacement ⋮ A decentralized partially observable Markov decision model with action duration for goal recognition in real time strategy games ⋮ Nonparametric adaptive control of discrete-time partially observable stochastic systems ⋮ Visual transfer for reinforcement learning via gradient penalty based Wasserstein domain confusion ⋮ Control limits for two-state partially observable Markov decision processes ⋮ Training and repair policies for stand-by systems ⋮ Cops and invisible robbers: the cost of drunkenness ⋮ Dynamic Learning and Decision Making via Basis Weight Vectors ⋮ Piecewise Linear Approximations for Partially Observable Markov Decision Processes with Finite Horizons ⋮ An integrated data-driven method using deep learning for a newsvendor problem with unobservable features ⋮ Markov Decision Processes with Incomplete Information and Semiuniform Feller Transition Probabilities ⋮ A nonlinear programming model for partially observable Markov decision processes: Finite horizon case ⋮ BOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHM ⋮ Future memories are not needed for large classes of POMDPs ⋮ Off-policy evaluation in partially observed Markov decision processes under sequential ignorability ⋮ Computation of approximate optimal policies in a partially observed inventory model with rain checks ⋮ Stochastic switching for partially observable dynamics and optimal asset allocation ⋮ SLAP: specification logic of actions with probability ⋮ Sequential process control under capacity constraints. ⋮ State observation accuracy and finite-memory policy performance ⋮ Group Maintenance: A Restless Bandits Approach ⋮ A review of operations research models in invasive species management: state of the art, challenges, and future directions ⋮ Nonstationary Bandits with Habituation and Recovery Dynamics ⋮ An Approximation Approach for Response-Adaptive Clinical Trial Design ⋮ A Multi-Stage Two-Machines Replacement Strategy Using Mixture Models, Bayesian Inference, and Stochastic Dynamic Programming ⋮ Optimal investment management for a defined contribution pension fund under imperfect information ⋮ An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes ⋮ On an Approach to Evaluation of Health Care Programme by Markov Decision Model ⋮ Markov decision processes ⋮ A survey of decision making and optimization under uncertainty ⋮ Active Inference, Belief Propagation, and the Bethe Approximation ⋮ Markov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information) ⋮ Multivariate Bayesian process control for a finite production run ⋮ Planning and acting in partially observable stochastic domains ⋮ Performance prediction of an unmanned airborne vehicle multi-agent system ⋮ Heuristic anytime approaches to stochastic decision processes ⋮ Optimal policies for inventory systems with finite capacity and partially observed Markov-modulated demand and supply processes ⋮ Probabilistic Acceptors for Languages over Infinite Words ⋮ Optimal condition based maintenance with imperfect information and the proportional hazards model ⋮ Optimally maintaining a multi-state system with limited imperfect preventive repairs ⋮ A Bayesian learning model for estimating unknown demand parameter in revenue management ⋮ Control strategy of speed servo systems based on deep reinforcement learning ⋮ Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion ⋮ Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy ⋮ A superharmonic approach to solving infinite horizon partially observable Markov decision problems ⋮ A simple suboptimal algorithm for system maintance under partial observability ⋮ On replacement policies for additive systems with several working levels ⋮ Monitoring machine operations using on-line sensors ⋮ Optimal policies under risk for changing software systems based on customer satisfaction ⋮ A tutorial on partially observable Markov decision processes ⋮ Learning, risk attitude and hot stoves in restless bandit problems ⋮ Stratified breast cancer follow-up using a continuous state partially observable Markov decision process ⋮ Blackwell optimality in Markov decision processes with partial observation. ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms ⋮ Integral control for population management ⋮ Limiting distributions of functionals of Markov chains ⋮ Stochastic control theory and operational research ⋮ Portfolio selection with imperfect information: A hidden Markov model ⋮ A survey of solution techniques for the partially observed Markov decision process ⋮ Optimal cost and policy for a Markovian replacement problem ⋮ Adaptive control of Markov processes with incomplete state information and unknown parameters ⋮ Extension of the Frank-Wolfe algorithm to concave nondifferentiable objective functions ⋮ A leader-follower partially observed, multiobjective Markov game ⋮ Value of information for a leader-follower partially observed Markov game ⋮ On the undecidability of probabilistic planning and related stochastic optimization problems ⋮ Equivalence notions and model minimization in Markov decision processes ⋮ Optimal sensor scheduling for hidden Markov model state estimation

This page was built for publication: State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3947462&oldid=17639038"