The Optimal Control of Partially Observable Markov Processes over a Finite Horizon

From MaRDI portal

Publication:4401309

Jump to:navigation, search

DOI10.1287/opre.21.5.1071zbMath0275.93059OpenAlexW2034725503WikidataQ29541847 ScholiaQ29541847MaRDI QIDQ4401309

Richard D. Smallwood, Edward J. Sondik

Publication date: 1973

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/opre.21.5.1071

Mathematics Subject Classification ID

Continuous-time Markov processes on general state spaces (60J25) Optimal stochastic control (93E20)

Related Items

Optimal Control of Partially Observable Semi-Markovian Failing Systems: An Analysis Using a Phase Methodology, Policy structure for discrete time Markov chain disorder problems, Goal-directed learning of features and forward models, Ambiguous partially observable Markov decision processes: structural results and applications, On arrival driven queueing models: Admission control, traffic policing, abandonments, and correlated arrivals, Finite Horizon Decision Timing with Partially Observable Poisson Processes, A two-state partially observable Markov decision process with three actions, On the construction of \(\epsilon\)-optimal strategies in partially observed MDPs, A survey of algorithmic methods for partially observed Markov decision processes, On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes, Multistate Bayesian Control Chart Over a Finite Horizon, Availability maximization under partial observations, Admission Control Policies in a Finite Capacity Geo/Geo/1 Queue Under Partial State Observations, Tutorial series on brain-inspired computing. IV: Reinforcement learning: machine learning and natural learning, Planning for multiple measurement channels in a continuous-state POMDP, The value of observing the condition of a deteriorating machine, An efficient heuristic for a partially observable Markov decision process of machine replacement, A unified model of qualitative belief change: a dynamical systems perspective, The skyline algorithm for POMDP value function pruning, Dynamic Learning and Decision Making via Basis Weight Vectors, Piecewise Linear Approximations for Partially Observable Markov Decision Processes with Finite Horizons, Markov Decision Processes with Incomplete Information and Semiuniform Feller Transition Probabilities, A nonlinear programming model for partially observable Markov decision processes: Finite horizon case, BOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHM, Semi-uniform Feller stochastic kernels, Inventory control with modulated demand and a partially observed modulation process, Abstraction and approximate decision-theoretic planning., Future memories are not needed for large classes of POMDPs, Finding optimal memoryless policies of POMDPs under the expected average reward criterion, Reconstructing the hidden states in time course data of stochastic models, Off-policy evaluation in partially observed Markov decision processes under sequential ignorability, Control limit policies for early detection of failure, Computation of approximate optimal policies in a partially observed inventory model with rain checks, Dynamic Selling Mechanisms for Product Differentiation and Learning, Decentralized MDPs with sparse interactions, A unified framework for stochastic optimization, The role of information in system stability with partially observable servers, Control: a perspective, Semi-Markovian decision models with vector-valued reward, Distributionally Robust Partially Observable Markov Decision Process with Moment-Based Ambiguity, An Approximation Approach for Response-Adaptive Clinical Trial Design, Convergence of probability measures and Markov decision models with incomplete information, An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes, Using POMDPs for learning cost sensitive decision trees, Managing mobile production-inventory systems influenced by a modulation process, Unnamed Item, Markov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information), Multivariate Bayesian process control for a finite production run, A fast approximation method for partially observable Markov decision processes, Planning and acting in partially observable stochastic domains, Partially observable Markov decision process approximations for adaptive sensing, Optimal policies for inventory systems with finite capacity and partially observed Markov-modulated demand and supply processes, Timing of testing and treatment for asymptomatic diseases, Finite-state, discrete-time optimization with randomly varying observation quality, Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation, Optimal condition based maintenance with imperfect information and the proportional hazards model, Timely Decision Analysis Enabled by Efficient Social Media Modeling, Recursively modeling other agents for decision making: a research perspective, Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy, Inverse optimization for assessing emerging technologies in breast cancer screening, Knowledge-based programs as succinct policies for partially observable domains, Optimal control-limit strategies for a partially observed replacement problem†, Reasoning about uncertain parameters and agent behaviors through encoded experiences and belief planning, Informed production optimization in hydrocarbon reservoirs, On replacement policies for additive systems with several working levels, A Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with applications in partially observable Markov decision processes, Monitoring machine operations using on-line sensors, Patient-Type Bayes-Adaptive Treatment Plans, A tutorial on partially observable Markov decision processes, Optimizing active surveillance for prostate cancer using partially observable Markov decision processes, Stochastic dynamic programming with factored representations, A New Computational Approach to Cost Variance Iuvestigation Problems, Stratified breast cancer follow-up using a continuous state partially observable Markov decision process, Transformation of partially observable Markov decision processes into piecewise linear ones, Portfolio selection with imperfect information: A hidden Markov model, A survey of solution techniques for the partially observed Markov decision process, A leader-follower partially observed, multiobjective Markov game, Value of information for a leader-follower partially observed Markov game, On the undecidability of probabilistic planning and related stochastic optimization problems, Optimal adaptive control policy for joint machine maintenance and product quality control, Optimal sensor scheduling for hidden Markov model state estimation

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4401309&oldid=18422632"