scientific article; zbMATH DE number 700091

From MaRDI portal
Revision as of 20:23, 6 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4315289

zbMath0829.90134MaRDI QIDQ4315289

Martin L. Puterman

Publication date: 6 December 1994


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Sequential variable sampling plan for normal distributionOptimization of a large-scale water reservoir network by stochastic dynamic programming with efficient state space discretizationOn the optimality equation for average cost Markov control processes with Feller transition probabilitiesEvent-based optimization approach for solving stochastic decision problems with probabilistic constraintTweaking the odds in probabilistic timed automataEvaluation and prediction of an optimal control in a processor sharing queueing system with heterogeneous serversRuntime monitors for Markov decision processesModel-free reinforcement learning for branching Markov decision processesSIR dynamics with vaccination in a large configuration modelProbabilistic planning with clear preferences on missing informationPractical solution techniques for first-order MDPsZero-sum stochastic games with average payoffs: new optimality conditionsOnline stochastic reservation systemsStrategy optimization for controlled Markov process with descriptive complexity constraintStochastic constraint programming: A scenario-based approachZero-sum continuous-time Markov games with unbounded transition and discounted payoff ratesOn ordinal comparison of policies in Markov reward processesThe dynamic shortest path problem with anticipationA fuzzy approach to Markov decision processes with uncertain transition probabilitiesMeans-end relations and a measure of efficacyMarginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queuePerfect information two-person zero-sum Markov games with imprecise transition probabilitiesKeep or return? Managing ordering and return policies in start-up companiesRevenue management for a make-to-order company with limited inventory capacityA formal mathematical framework for modeling probabilistic hybrid systemsA semimartingale characterization of average optimal stationary policies for Markov decision processesClinic scheduling models with overbooking for patients with heterogeneous no-show probabilitiesMulti-objective optimization of water-using systemsAllocation of empty containers between multi-portsExact decomposition approaches for Markov decision processes: a surveyInterleaving solving and elicitation of constraint satisfaction problems based on expected costRisk-averse dynamic programming for Markov decision processesThe policy iteration algorithm for average continuous control of piecewise deterministic Markov processesA variable neighborhood search based algorithm for finite-horizon Markov decision processesPerformance evaluation of direct heuristic dynamic programming using control-theoretic measuresReducing reinforcement learning to KWIK online regressionAn actor-critic algorithm with function approximation for discounted cost constrained Markov decision processesOn a multi-period supply chain system with supplementary order opportunityRanking policies in discrete Markov decision processesDynamic control of a single-server system with abandonmentsStochastic control via direct comparisonTime aggregated Markov decision processes via standard dynamic programmingExplicit solution of the average-cost optimality equation for a pest-control problemPerformance analysis for controlled semi-Markov systems with application to maintenanceIndustry dynamics: foundations for models with an infinite number of firmsCompletion-of-squares: revisited and extendedUsing negotiable features for prescription problemsSpecifying and computing preferred plansThe orienteering problem with stochastic travel and service timesA dynamic programming strategy to balance exploration and exploitation in the bandit problemOptimization of heuristic search using recursive algorithm selection and reinforcement learningDecentralized MDPs with sparse interactionsDiscounted continuous-time constrained Markov decision processes in Polish spacesOptimal resource allocation for multiqueue systems with a shared server poolApproximation of Markov decision processes with general state spaceManagement of the risk of wind damage in forestry: a graph-based Markov decision process approachResource allocation in congested queueing systems with time-varying demand: an application to airport operationsComputing equilibria in discounted dynamic gamesAnalyzing anonymity attacks through noisy channelsExact and approximate Nash equilibria in discounted Markov stopping games with terminal redemptionIntegrating inventory control and capacity management at a maintenance service providerControl-limit policies for a class of stopping time problems with termination restrictionsValue set iteration for two-person zero-sum Markov gamesAn exponential lower bound for Cunningham's ruleContinuous-time Markov decision processes with risk-sensitive finite-horizon cost criterionFinite approximation of the first passage models for discrete-time Markov decision processes with varying discount factorsQuantitative model-checking of controlled discrete-time Markov processesPolicy iteration for robust nonstationary Markov decision processesPseudopolynomial iterative algorithm to solve total-payoff games and min-cost reachability gamesAdmission control in UMTS networks based on approximate dynamic programmingPerformance optimization of semi-Markov decision processes with discounted-cost criteriaFinite approximation for finite-horizon continuous-time Markov decision processesMeet your expectations with guarantees: beyond worst-case synthesis in quantitative gamesStochastic games with unbounded payoffs: applications to robust control in economicsAccuracy of fluid approximations to controlled birth-and-death processes: absorbing caseA policy iteration heuristic for constrained discounted controlled Markov chainsSemi-Markov control models with partially known holding times distribution: discounted and average criteriaDynamic pricing and scheduling in a multi-class single-server queueing systemDynamic resource allocation in a multi-product make-to-stock production systemSampled fictitious play for approximate dynamic programmingGeneral notions of indexability for queueing control and asset managementA unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain casesTeaching randomized learners with feedbackApproximate dynamic programming via direct search in the space of value function approximationsA tractable discrete fractional programming: application to constrained assortment optimizationStochastic decomposition applied to large-scale hydro valleys managementOptimal and heuristic policies for assemble-to-order systems with different review periodsA stochastic dynamic programming approach for delay management of a single train lineM/G/\(1\) queue with event-dependent arrival ratesHeuristic procedures for a stochastic batch service problemProgram repair without regretPolicy gradient in Lipschitz Markov decision processesFinite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov gamesOptimality, equilibrium, and curb sets in decision problems without commitmentOn essential information in sequential decision processesOn mean reward variance in semi-Markov processesOn the optimality of a full-service policy for a queueing system with discounted costsSolving factored MDPs using non-homogeneous partitionsA multigenerational game model to analyze sustainable developmentConstraint solving in uncertain and dynamic environments: A survey







This page was built for publication: