Discrete Dynamic Programming

From MaRDI portal
Publication:5342868

DOI10.1214/aoms/1177704593zbMath0133.12906OpenAlexW1986389067WikidataQ110952978 ScholiaQ110952978MaRDI QIDQ5342868

David Blackwell

Publication date: 1962

Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoms/1177704593




Related Items

Late marriage and transition from arranged marriages to love matches: A search-theoretic approachContinuous time markov decision processes with interventionsOPTIMALITY OF TRUNK RESERVATION FOR AN M/M/K/N QUEUE WITH SEVERAL CUSTOMER TYPES AND HOLDING COSTSOptimality equations and sensitive optimality in bounded Markov decision processes1BLACKWELL OPTIMAL STRATEGIES IN PRIORITY MEAN-PAYOFF GAMESAn improved algorithm for solving communicating average reward Markov decision processesOn Markovian decision programming with recursive reward functionsOn regularly perturbed fundamental matricesAverage Cost Markov Decision Processes with Weakly Continuous Transition ProbabilitiesFinite state dynamic programming with the total reward criterionFinite-Memory Strategies in POMDPs with Long-Run Average ObjectivesSporadic overtaking optimality in Markov decision problemsSome basic concepts of numerical treatment of Markov decision modelsBlackwell Optimality for Controlled Diffusion ProcessesEntscheidungsmodelle über angeordneten körpernSensitivitätsanalysen in entscheidungsmodellenBlackwell optimal policies in a Markov decision process with a Borel state spaceSurvey of linear programming for standard and nonstandard Markovian control problems. Part I: TheoryStrong 0-discount optimal policies in a Markov decision process with a Borel state spaceA Markovian decision model of adaptive cancer treatment and quality of lifeApproximations for the distribution of perpetuities with small discount ratesAn epistemic approach to stochastic gamesFour Canadian Contributions to Stochastic ModelingDoes free information provision crowd out costly information acquisition? It's a matter of timingSolution of a Markovian decision problem by successive overrelaxationDeterministic discrete dynamic programming with discount factor greater than one: Some further results and algorithmsAn axiomatic approach to Markov decision processesUnnamed ItemSTRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS*Interview with Andrzej Nowak - Laureate of the Rufus Isaacs AwardAnother Set of Conditions for Strongn(n = −1, 0) Discount Optimality in Markov Decision ProcessesFuzzy optimality relation for perceptive MDPs-the average caseUnnamed ItemBLACKWELL OPTIMALITY IN STOCHASTIC GAMESSample-path optimality and variance-maximization for Markov decision processesA Policy Improvement Algorithm for Solving a Mixture Class of Perfect Information and AR-AT Semi-Markov GamesA decision exclusion algorithm for a class of Markovian Decision ProcessesSemi-infinite semi-Markov stochastic games.Optimality of intuitive checkpointing policiesSolution procedures for multi-objective markov decision processesA set of successive approximation methods for discounted Markovian decision problemsTransient policies in discrete dynamic programming: Linear programming including suboptimality tests and additional constraintsAn application of Markov potential theory to Markovian decision processesA Fixed Point Approach to Undiscounted Markov Renewal ProgramsOptimality in transient markov chains and linear programmingBlackwell optimality in the class of Markov policies for continuous-time controlled Markov chainsOn the chance to visit a goal set infinitely oftenFinitely Additive Dynamic ProgrammingMarkov Branching Decision Chains with Interest-Rate-Dependent RewardsContinuous-Time Markov Decision Processes with Unbounded Transition and Discounted-Reward RatesCredibilistic Markov decision processes: The average caseErgodic Control, Bias, and Sensitive Discount Optimality for Markov Diffusion ProcessesThe optimization of K-effect models by linear and dynamic programmingFinite state continuous time Markov decision processes with an infinite planning horizonSome remarks on a Markovian decision problem with an absorbing stateLinear programming considerations on Markovian decision processes with no discountingStrong Uniform Value in Gambling Houses and Partially Observable Markov Decision ProcessesLinear programming algorithms for semi-Markovian decision processesOn direct sums of Markovian decision processOn the set of optimal policies in discrete dynamic programmingOptimality of intuitive checkpointing policiesOn a set of optimal policies in continuous time Markovian decision problemPure Equilibrium Strategies for Stochastic Games via Potential FunctionsOn zero-sum two-person undiscounted semi-Markov games with a multichain structureUniform Tauberian theorem in differential gamesA new optimality criterion for discrete dynamic programmingAlgorithms for discounted stochastic gamesHistory-dependent Evaluations in Partially Observable Markov Decision ProcessA further anticycling rule in multichain policy iteration for undiscounted Markov renewal programsOn the solvability of Bellman's functional equation for a Markovian decision processOptimal control of stationary Markov processesFinite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimalityMaximum-Stopping-Value Policies in Finite Markov Population Decision ChainsUnnamed ItemOrdered Field Property for Semi-Markov Games when One Player Controls Transition Probabilities and Transition TimesCommutative Stochastic GamesOptimal Inventory Control and Allocation for Sequential Internet AuctionsMARKOV DECISION PROCESSESSemi-supervised learning with regularized LaplacianIndex-based policies for discounted multi-armed bandits on parallel machines.Bilinear programming and structured stochastic gamesOptimal inventory control with fixed ordering cost for selling by Internet auctionsAn efficient basis update for asymptotic linear programmingSolvable states in stochastic gamesOn undiscounted semi-Markov decision processes with absorbing statesA finite step algorithm via a bimatrix game to a single controller non- zero sum stochastic gameBias optimality and strong \(n\) \((n= -1,0)\) discount optimality for Markov decision processesComputational aspects in applied stochastic controlUnbounded dynamic programming via the Q-transformAn information-theoretic analysis of return maximization in reinforcement learningA fuzzy approach to Markov decision processes with uncertain transition probabilitiesTauberian theorem for value functionsMarginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queueAcceptable strategy profiles in stochastic gamesRemarks on sensitive equilibria in stochastic games with additive reward and transition structureDynamic diagnostic and decision procedures under uncertaintyOn efficiency of linear programming applied to discounted Markovian decision problemsStrong 1-optimal stationary policies in denumerable Markov decision processesStability-constrained Markov decision processes using MPCCyclic Markov equilibria in stochastic gamesA canonical form for pencils of matrices with applications to asymptotic linear programsMarkovian sequential control processes. Denumerable state spaceConditions for existence of average and Blackwell optimal stationary policies in denumerable Markov decision processesCommunicating MDPs: Equivalence and LP propertiesA pseudometric in supervisory control of probabilistic discrete event systemsOn the solvability of Bellman's functional equations for Markov renewal programmingOn canonical forms for zero-sum stochastic mean payoff gamesA generalized inverse method for asymptotic linear programmingDiscounting axioms imply risk neutralityComputing semi-stationary optimal policies for multichain semi-Markov decision processesOptimal eviction policies for stochastic address tracesThe optimal frequency of information purchasesDynamic competition with consumer inertiaAn orderfield property for stochastic games when one player controls transition probabilitiesOrdered field property for stochastic games when the player who controls transitions changes from state to stateReachability and safety objectives in Markov decision processes on long but finite horizonsOn optimality criteria for dynamic programs with long finite horizonsOn Nash equilibria and improvement cycles in pure positional strategies for chess-like and backgammon-like \(n\)-person gamesInvariant problems in dynamic programming - average reward criterionRandomization and simplification in dynamic decision-making.Resolvent expansions of matrices and applicationsA decomposition algorithm for limiting average Markov decision problems.Capital accumulation and the optimization of renewable resource modelsPerfect equilibria in stochastic gamesControl: a perspectiveOptimal threshold probability in undiscounted Markov decision processes with a target set.Dynamic priority allocation via restless bandit marginal productivity indicesHerbert Robbins and sequential analysisSingulary perturbed Markov control problem: Limiting average costDenumerable semi-Markov decision chains with small interest ratesAn elementary approach to discrete models of dividend strategiesDynamic programming and Hamilton-Jacobi-Bellman equations on time scalesNonlinear programming and stationary equilibria in stochastic gamesEstimation and control in multichain processesPolicy improvement for perfect information additive reward and additive transition stochastic games with discounted and average payoffsGeneral limit value in dynamic programmingA note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costsQuantum games: a review of the history, current state, and interpretationA nested family of \(k\)-total effective rewards for positional gamesThe value functions of Markov decision processesReview of a Markov decision algorithm for optimal inspections and revisions in a maintenance system with partial informationOptimal inspection policies for a manufacturing stationSome remarks on the new optimality criterion of Mine and TabataOn the convergence of the average expected return in dynamic programmingA unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain casesSemi-Markov decision processes with limiting ratio average rewardsOn the existence of relative values for undiscounted multichain Markov decision processesContinuous versus measurable recourse in N-stage stochastic programmingContinuous time control of Markov processes on an arbitrary state space: average return criterionAn optimality principle for Markovian decision processesOptimal threshold probability and expectation in semi-Markov decision processesShould I remember more than you? Best responses to factored strategiesStochastic convex programming: Kuhn-Tucker conditionsProblemi di ottimizzazione nella teoria delle codeSemi-Markov strategies in stochastic gamesSingularly perturbed linear programs and Markov decision processesFoolproof convergence in multichain policy iterationA survey of recent results on continuous-time Markov decision processes (with comments and rejoinder)Optimization of stochastic maintenance policiesSensitivity of finite Markov chains under perturbationStationary \(\varepsilon\)-optimal strategies in stochastic gamesBounded variation of \(\{V_ n\}\) and its limitPure equilibria in a simple dynamic model of strategic market gameAdmission control in a two-class loss system with periodically varying parameters and abandonmentsOptimal replenishment for a periodic review inventory system with two supply modes.Markov-type fuzzy decision processes with a discounted reward on a closed intervalSequential identification and adaptive control in stochastic systemsOptimization models for the first arrival target distribution function in discrete timeAre limits of \(\alpha\)-discounted optimal policies Blackwell optimal? A counterexampleControlled semi-Markov models under long-run average rewardsLong-term average cost control problems for continuous time Markov processes: A surveyBlackwell optimality in Markov decision processes with partial observation.Sensitivity analysis in discounted Markovian decision problemsControlled Markov set-chains under average criteriaTwo-player stochastic games. II: The case of recursive gamesFuzzy decision processes with an average reward criterion.Optimal search with positive switch cost is NP-hardDecentralized evolutionary mechanisms for intertemporal economies: A possibility resultExact formula for sensitivity analysis of Markov chains