scientific article; zbMATH DE number 700091

From MaRDI portal

Revision as of 20:23, 6 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4315289

Jump to:navigation, search

zbMath0829.90134MaRDI QIDQ4315289

Martin L. Puterman

Publication date: 6 December 1994

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)

Related Items (only showing first 100 items - show all)

Sequential variable sampling plan for normal distribution ⋮ Optimization of a large-scale water reservoir network by stochastic dynamic programming with efficient state space discretization ⋮ On the optimality equation for average cost Markov control processes with Feller transition probabilities ⋮ Event-based optimization approach for solving stochastic decision problems with probabilistic constraint ⋮ Tweaking the odds in probabilistic timed automata ⋮ Evaluation and prediction of an optimal control in a processor sharing queueing system with heterogeneous servers ⋮ Runtime monitors for Markov decision processes ⋮ Model-free reinforcement learning for branching Markov decision processes ⋮ SIR dynamics with vaccination in a large configuration model ⋮ Probabilistic planning with clear preferences on missing information ⋮ Practical solution techniques for first-order MDPs ⋮ Zero-sum stochastic games with average payoffs: new optimality conditions ⋮ Online stochastic reservation systems ⋮ Strategy optimization for controlled Markov process with descriptive complexity constraint ⋮ Stochastic constraint programming: A scenario-based approach ⋮ Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates ⋮ On ordinal comparison of policies in Markov reward processes ⋮ The dynamic shortest path problem with anticipation ⋮ A fuzzy approach to Markov decision processes with uncertain transition probabilities ⋮ Means-end relations and a measure of efficacy ⋮ Marginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queue ⋮ Perfect information two-person zero-sum Markov games with imprecise transition probabilities ⋮ Keep or return? Managing ordering and return policies in start-up companies ⋮ Revenue management for a make-to-order company with limited inventory capacity ⋮ A formal mathematical framework for modeling probabilistic hybrid systems ⋮ A semimartingale characterization of average optimal stationary policies for Markov decision processes ⋮ Clinic scheduling models with overbooking for patients with heterogeneous no-show probabilities ⋮ Multi-objective optimization of water-using systems ⋮ Allocation of empty containers between multi-ports ⋮ Exact decomposition approaches for Markov decision processes: a survey ⋮ Interleaving solving and elicitation of constraint satisfaction problems based on expected cost ⋮ Risk-averse dynamic programming for Markov decision processes ⋮ The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes ⋮ A variable neighborhood search based algorithm for finite-horizon Markov decision processes ⋮ Performance evaluation of direct heuristic dynamic programming using control-theoretic measures ⋮ Reducing reinforcement learning to KWIK online regression ⋮ An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes ⋮ On a multi-period supply chain system with supplementary order opportunity ⋮ Ranking policies in discrete Markov decision processes ⋮ Dynamic control of a single-server system with abandonments ⋮ Stochastic control via direct comparison ⋮ Time aggregated Markov decision processes via standard dynamic programming ⋮ Explicit solution of the average-cost optimality equation for a pest-control problem ⋮ Performance analysis for controlled semi-Markov systems with application to maintenance ⋮ Industry dynamics: foundations for models with an infinite number of firms ⋮ Completion-of-squares: revisited and extended ⋮ Using negotiable features for prescription problems ⋮ Specifying and computing preferred plans ⋮ The orienteering problem with stochastic travel and service times ⋮ A dynamic programming strategy to balance exploration and exploitation in the bandit problem ⋮ Optimization of heuristic search using recursive algorithm selection and reinforcement learning ⋮ Decentralized MDPs with sparse interactions ⋮ Discounted continuous-time constrained Markov decision processes in Polish spaces ⋮ Optimal resource allocation for multiqueue systems with a shared server pool ⋮ Approximation of Markov decision processes with general state space ⋮ Management of the risk of wind damage in forestry: a graph-based Markov decision process approach ⋮ Resource allocation in congested queueing systems with time-varying demand: an application to airport operations ⋮ Computing equilibria in discounted dynamic games ⋮ Analyzing anonymity attacks through noisy channels ⋮ Exact and approximate Nash equilibria in discounted Markov stopping games with terminal redemption ⋮ Integrating inventory control and capacity management at a maintenance service provider ⋮ Control-limit policies for a class of stopping time problems with termination restrictions ⋮ Value set iteration for two-person zero-sum Markov games ⋮ An exponential lower bound for Cunningham's rule ⋮ Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion ⋮ Finite approximation of the first passage models for discrete-time Markov decision processes with varying discount factors ⋮ Quantitative model-checking of controlled discrete-time Markov processes ⋮ Policy iteration for robust nonstationary Markov decision processes ⋮ Pseudopolynomial iterative algorithm to solve total-payoff games and min-cost reachability games ⋮ Admission control in UMTS networks based on approximate dynamic programming ⋮ Performance optimization of semi-Markov decision processes with discounted-cost criteria ⋮ Finite approximation for finite-horizon continuous-time Markov decision processes ⋮ Meet your expectations with guarantees: beyond worst-case synthesis in quantitative games ⋮ Stochastic games with unbounded payoffs: applications to robust control in economics ⋮ Accuracy of fluid approximations to controlled birth-and-death processes: absorbing case ⋮ A policy iteration heuristic for constrained discounted controlled Markov chains ⋮ Semi-Markov control models with partially known holding times distribution: discounted and average criteria ⋮ Dynamic pricing and scheduling in a multi-class single-server queueing system ⋮ Dynamic resource allocation in a multi-product make-to-stock production system ⋮ Sampled fictitious play for approximate dynamic programming ⋮ General notions of indexability for queueing control and asset management ⋮ A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases ⋮ Teaching randomized learners with feedback ⋮ Approximate dynamic programming via direct search in the space of value function approximations ⋮ A tractable discrete fractional programming: application to constrained assortment optimization ⋮ Stochastic decomposition applied to large-scale hydro valleys management ⋮ Optimal and heuristic policies for assemble-to-order systems with different review periods ⋮ A stochastic dynamic programming approach for delay management of a single train line ⋮ M/G/\(1\) queue with event-dependent arrival rates ⋮ Heuristic procedures for a stochastic batch service problem ⋮ Program repair without regret ⋮ Policy gradient in Lipschitz Markov decision processes ⋮ Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games ⋮ Optimality, equilibrium, and curb sets in decision problems without commitment ⋮ On essential information in sequential decision processes ⋮ On mean reward variance in semi-Markov processes ⋮ On the optimality of a full-service policy for a queueing system with discounted costs ⋮ Solving factored MDPs using non-homogeneous partitions ⋮ A multigenerational game model to analyze sustainable development ⋮ Constraint solving in uncertain and dynamic environments: A survey

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4315289&oldid=18262836"