scientific article; zbMATH DE number 5685899

From MaRDI portal

Publication:5305630

Jump to:navigation, search

zbMath1184.90170MaRDI QIDQ5305630

Martin L. Puterman

Publication date: 22 March 2010

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)

Related Items (showing only first 100 - show all)

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints ⋮ Unnamed Item ⋮ Markov decision processes for infinite horizon problems solved with the cosine simplex method ⋮ An Incremental Fast Policy Search Using a Single Sample Path ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A forwards induction approach to candidate drug selection ⋮ Experimental Design for Partially Observed Markov Decision Processes ⋮ Unnamed Item ⋮ An Approach for Determining Stationary Equilibria in a Single-Controller Average Stochastic Game ⋮ Unnamed Item ⋮ Average Cost Brownian Drift Control with Proportional Changeover Costs ⋮ Stationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systems ⋮ Computing Behavioral Relations for Probabilistic Concurrent Systems ⋮ Multi-hop sensor network scheduling for optimal remote estimation ⋮ A Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and Stabilization ⋮ Effective Scenarios in Multistage Distributionally Robust Optimization with a Focus on Total Variation Distance ⋮ Unnamed Item ⋮ A queueing model for customer rescheduling and no-shows in service systems ⋮ A numerical study of Markov decision process algorithms for multi-component replacement problems ⋮ Unnamed Item ⋮ Another set of verifiable conditions for average Markov decision processes with Borel spaces ⋮ An exponential lower bound for Zadeh's pivot rule ⋮ A scalable anticipatory policy for the dynamic pickup and delivery problem ⋮ Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands ⋮ OPTIMAL CONTROL OF A TWO-SERVER QUEUEING SYSTEM WITH FAILURES ⋮ Formalization of methods for the development of autonomous artificial intelligence systems ⋮ Optimal policies for stochastic clearing systems with time‐dependent delay penalties ⋮ Block Policy Mirror Descent ⋮ Model checking differentially private properties ⋮ A unified algorithm framework for mean-variance optimization in discounted Markov decision processes ⋮ Smoothing policies and safe policy gradients ⋮ A dynamic analytic method for risk-aware controlled martingale problems ⋮ A specification logic for programs in the probabilistic guarded command language ⋮ Task allocation and on-the-job training ⋮ Unnamed Item ⋮ A framework to measure the robustness of programs in the unpredictable environment ⋮ OPTIMIZATION OF OVERFLOW POLICIES IN CALL CENTERS ⋮ Optimal Routing of Fixed Size Jobs to Two Parallel Servers ⋮ Adaptive constraint satisfaction for Markov decision process congestion games: application to transportation networks ⋮ Average cost minimization in a multi-server retrial queueing system with a controllable reserve group of servers ⋮ Premium control with reinforcement learning ⋮ OPTIMALLY REPLACING MULTIPLE SYSTEMS IN A SHARED ENVIRONMENT ⋮ Distributionally Robust Partially Observable Markov Decision Process with Moment-Based Ambiguity ⋮ On the Value Function of the M/G/1 FCFS and LCFS Queues ⋮ Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Iterative Improvement of Lower and Upper Bounds for Backward SDEs ⋮ Scheduling services in a queuing system with impatience and setup costs ⋮ Dynamic Pricing with a Poisson Bandit Model ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A CTMDP-Based Exact Method for RCPSP with Uncertain Activity Durations and Rework ⋮ Unnamed Item ⋮ Optimal Kullback–Leibler approximation of Markov chains via nuclear norm regularisation ⋮ Dynamic Decision Making in Energy Systems with Storage and Renewable Energy Sources ⋮ Minimising average passenger waiting time in personal rapid transit systems ⋮ Fast value iteration: an application of Legendre-Fenchel duality to a class of deterministic dynamic programming problems in discrete time ⋮ Unnamed Item ⋮ Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains ⋮ Unnamed Item ⋮ A Survey of Bidding Games on Graphs (Invited Paper) ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ <html> Nash ε-equilibria for stochastic games with total reward functions: an approach through Markov decision processes</html> ⋮ A Continuous-Time Markov Decision Process for Infrastructure Surveillance ⋮ To wait or not to wait: Optimal ordering under lead time uncertainty and forecast updating ⋮ Repeated Sequential Prisoner's Dilemma: The Stackleberg Variant ⋮ Solving the drift control problem ⋮ Synchronization and control in intrinsic and designed computation: An information-theoretic analysis of competing models of stochastic computation ⋮ Empirical Q-Value Iteration ⋮ A Convex Analytic Approach to Risk-Aware Markov Decision Processes ⋮ Concurrent MDPs with Finite Markovian Policies ⋮ Multiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision Processes ⋮ Unnamed Item ⋮ On Nash Equilibria in Stochastic Positional Games with Average Payoffs ⋮ Unnamed Item ⋮ Multiple stopping time POMDPs: structural results \& application in interactive advertising on social media ⋮ Demand seasonality in retail inventory management ⋮ Robust decomposable Markov decision processes motivated by allocating school budgets ⋮ Solving dynamic public insurance games with endogenous agent distributions: theory and computational approximation ⋮ Revenue management for operations with urgent orders ⋮ Parametric replenishment policies for inventory systems with lost sales and fixed order cost ⋮ Optimal sensor scheduling for multiple linear dynamical systems ⋮ Optimal inventory management using retail prepacks ⋮ Fully probabilistic design of strategies with estimator ⋮ Comparing strategies to prevent stroke and ischemic heart disease in the Tunisian population: Markov modeling approach using a comprehensive sensitivity analysis algorithm ⋮ Solving stochastic resource-constrained project scheduling problems by closed-loop approximate dynamic programming ⋮ A two-state partially observable Markov decision process with three actions ⋮ Continue, quit, restart probability model ⋮ Perspectives of approximate dynamic programming ⋮ Optimal decisions for continuous time Markov decision processes over finite planning horizons ⋮ On transition matrices of Markov chains corresponding to Hamiltonian cycles ⋮ A model for equilibrium in some service-provider user-set interactions ⋮ Heuristic decision rules for short-term trading of renewable energy with co-located energy storage ⋮ Game theoretic interaction and decision: a quantum analysis ⋮ A stochastic game approach to the security issue of networked control systems under jamming attacks ⋮ Scheduling of multi-class multi-server queueing systems with abandonments ⋮ Frameworks and results in distributionally robust optimization

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5305630&oldid=19971670"