Bayesian dynamic programming
From MaRDI portal
Publication:4077765
DOI10.2307/1426080zbMath0316.90081OpenAlexW2327833721MaRDI QIDQ4077765
Publication date: 1975
Published in: Advances in Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2307/1426080
Bayesian problems; characterization of Bayes procedures (62C10) Adaptive control/observation systems (93C40) Markov and semi-Markov decision processes (90C40)
Related Items
Asymptotic optimality of tracking policies in stochastic networks. ⋮ Semicontinuous nonstationary stochastic games ⋮ On the generic nonconvergence of Bayesian actions and beliefs ⋮ Sufficient conditions for optimality of a \((z,c^ -,c^ +)\)-sampling plan in multistage Bayesian acceptance sampling ⋮ Optimal learning with costly adjustment ⋮ Markov games with incomplete information ⋮ Markov Decision Processes with Incomplete Information and Semiuniform Feller Transition Probabilities ⋮ Nonparametric Adaptive Robust Control under Model Uncertainty ⋮ Data-driven nonparametric robust control under dependence uncertainty ⋮ On theory and algorithms for Markov decision problems with the total reward criterion ⋮ A Bayesian approach to the triage problem with imperfect classification ⋮ A natural extension of the MacQueen extrapolation ⋮ Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal ⋮ Estimation and control in multichain processes ⋮ Structured policies in the sequential design of experiments ⋮ A general storage model with applications to energy systems ⋮ Convergence of probability measures and Markov decision models with incomplete information ⋮ Cash management in a randomly varying environment ⋮ On dynamic programming: Compactness of the space of policies ⋮ On the optimality of (z, Z)-order-policies in adaptive inventory control ⋮ Some results on analytic spaces and semi-analytic functions with regard to gambling theory ⋮ On a representation of measurable automaton transformations by stochastic automata ⋮ Good news and bad news in two-armed bandits ⋮ Dynamic risk measures under model uncertainty ⋮ Customer Scheduling with Incomplete Information ⋮ Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities ⋮ Numerical aspects in Bayesian inventory control ⋮ On piecewise deterministic Markov control processes: Control of jumps and of risk processes in insurance ⋮ Variance Regularization in Sequential Bayesian Optimization ⋮ Generalized Bandit Problems ⋮ Adaptive Policies in Markov Decision Processes with Uncertain Transition Matrices ⋮ On granting credit in a random environment ⋮ On the improvement of allocation rules for multi-armed bandit problem