State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms

From MaRDI portal
Publication:3947462

DOI10.1287/mnsc.28.1.1zbMath0486.90084OpenAlexW2123651102MaRDI QIDQ3947462

George E. Monahan

Publication date: 1982

Published in: Management Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/mnsc.28.1.1




Related Items (86)

Optimal sequential file searchAmbiguous partially observable Markov decision processes: structural results and applicationsRobust set-point regulation for ecological models with multiple management goalsPARTIALLY OBSERVABLE MARKOV DECISION PROCESSES AND PERIODIC POLICIES WITH APPLICATIONSAn optimal inspection and replacement policy under incomplete state informationPartially observable Markov decision processes incorporatingStochastic allocation of inspection capacity to competitive processesFinite Horizon Decision Timing with Partially Observable Poisson ProcessesExperimental Design for Partially Observed Markov Decision ProcessesStructural results for partially observed control modelsA two-state partially observable Markov decision process with three actionsOn the construction of \(\epsilon\)-optimal strategies in partially observed MDPsA survey of algorithmic methods for partially observed Markov decision processesOn the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesOn model order estimation for partially observed Markov chainsDynamic Pricing and Learning with Finite InventoriesModel Checking Linear-Time Properties of Probabilistic SystemsPlanning for multiple measurement channels in a continuous-state POMDPThe value of observing the condition of a deteriorating machineAn efficient heuristic for a partially observable Markov decision process of machine replacementA decentralized partially observable Markov decision model with action duration for goal recognition in real time strategy gamesNonparametric adaptive control of discrete-time partially observable stochastic systemsVisual transfer for reinforcement learning via gradient penalty based Wasserstein domain confusionControl limits for two-state partially observable Markov decision processesTraining and repair policies for stand-by systemsCops and invisible robbers: the cost of drunkennessDynamic Learning and Decision Making via Basis Weight VectorsPiecewise Linear Approximations for Partially Observable Markov Decision Processes with Finite HorizonsAn integrated data-driven method using deep learning for a newsvendor problem with unobservable featuresMarkov Decision Processes with Incomplete Information and Semiuniform Feller Transition ProbabilitiesA nonlinear programming model for partially observable Markov decision processes: Finite horizon caseBOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHMFuture memories are not needed for large classes of POMDPsOff-policy evaluation in partially observed Markov decision processes under sequential ignorabilityComputation of approximate optimal policies in a partially observed inventory model with rain checksStochastic switching for partially observable dynamics and optimal asset allocationSLAP: specification logic of actions with probabilitySequential process control under capacity constraints.State observation accuracy and finite-memory policy performanceGroup Maintenance: A Restless Bandits ApproachA review of operations research models in invasive species management: state of the art, challenges, and future directionsNonstationary Bandits with Habituation and Recovery DynamicsAn Approximation Approach for Response-Adaptive Clinical Trial DesignA Multi-Stage Two-Machines Replacement Strategy Using Mixture Models, Bayesian Inference, and Stochastic Dynamic ProgrammingOptimal investment management for a defined contribution pension fund under imperfect informationAn integrated approach to solving influence diagrams and finite-horizon partially observable decision processesOn an Approach to Evaluation of Health Care Programme by Markov Decision ModelMarkov decision processesA survey of decision making and optimization under uncertaintyActive Inference, Belief Propagation, and the Bethe ApproximationMarkov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information)Multivariate Bayesian process control for a finite production runPlanning and acting in partially observable stochastic domainsPerformance prediction of an unmanned airborne vehicle multi-agent systemHeuristic anytime approaches to stochastic decision processesOptimal policies for inventory systems with finite capacity and partially observed Markov-modulated demand and supply processesProbabilistic Acceptors for Languages over Infinite WordsOptimal condition based maintenance with imperfect information and the proportional hazards modelOptimally maintaining a multi-state system with limited imperfect preventive repairsA Bayesian learning model for estimating unknown demand parameter in revenue managementControl strategy of speed servo systems based on deep reinforcement learningSuccessive approximations in partially observable controlled Markov chains with risk-sensitive average criterionUndiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policyA superharmonic approach to solving infinite horizon partially observable Markov decision problemsA simple suboptimal algorithm for system maintance under partial observabilityOn replacement policies for additive systems with several working levelsMonitoring machine operations using on-line sensorsOptimal policies under risk for changing software systems based on customer satisfactionA tutorial on partially observable Markov decision processesLearning, risk attitude and hot stoves in restless bandit problemsStratified breast cancer follow-up using a continuous state partially observable Markov decision processBlackwell optimality in Markov decision processes with partial observation.Multi-agent reinforcement learning: a selective overview of theories and algorithmsIntegral control for population managementLimiting distributions of functionals of Markov chainsStochastic control theory and operational researchPortfolio selection with imperfect information: A hidden Markov modelA survey of solution techniques for the partially observed Markov decision processOptimal cost and policy for a Markovian replacement problemAdaptive control of Markov processes with incomplete state information and unknown parametersExtension of the Frank-Wolfe algorithm to concave nondifferentiable objective functionsA leader-follower partially observed, multiobjective Markov gameValue of information for a leader-follower partially observed Markov gameOn the undecidability of probabilistic planning and related stochastic optimization problemsEquivalence notions and model minimization in Markov decision processesOptimal sensor scheduling for hidden Markov model state estimation




This page was built for publication: State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms