Bayesian dynamic programming

From MaRDI portal
Publication:4077765

DOI10.2307/1426080zbMath0316.90081OpenAlexW2327833721MaRDI QIDQ4077765

Ulrich Rieder

Publication date: 1975

Published in: Advances in Applied Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.2307/1426080




Related Items

Asymptotic optimality of tracking policies in stochastic networks.Semicontinuous nonstationary stochastic gamesOn the generic nonconvergence of Bayesian actions and beliefsSufficient conditions for optimality of a \((z,c^ -,c^ +)\)-sampling plan in multistage Bayesian acceptance samplingOptimal learning with costly adjustmentMarkov games with incomplete informationMarkov Decision Processes with Incomplete Information and Semiuniform Feller Transition ProbabilitiesNonparametric Adaptive Robust Control under Model UncertaintyData-driven nonparametric robust control under dependence uncertaintyOn theory and algorithms for Markov decision problems with the total reward criterionA Bayesian approach to the triage problem with imperfect classificationA natural extension of the MacQueen extrapolationConditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimalEstimation and control in multichain processesStructured policies in the sequential design of experimentsA general storage model with applications to energy systemsConvergence of probability measures and Markov decision models with incomplete informationCash management in a randomly varying environmentOn dynamic programming: Compactness of the space of policiesOn the optimality of (z, Z)-order-policies in adaptive inventory controlSome results on analytic spaces and semi-analytic functions with regard to gambling theoryOn a representation of measurable automaton transformations by stochastic automataGood news and bad news in two-armed banditsDynamic risk measures under model uncertaintyCustomer Scheduling with Incomplete InformationPartially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition ProbabilitiesNumerical aspects in Bayesian inventory controlOn piecewise deterministic Markov control processes: Control of jumps and of risk processes in insuranceVariance Regularization in Sequential Bayesian OptimizationGeneralized Bandit ProblemsAdaptive Policies in Markov Decision Processes with Uncertain Transition MatricesOn granting credit in a random environmentOn the improvement of allocation rules for multi-armed bandit problem