scientific article; zbMATH DE number 5685899

From MaRDI portal
Publication:5305630

zbMath1184.90170MaRDI QIDQ5305630

Martin L. Puterman

Publication date: 22 March 2010


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (showing only first 100 - show all)

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpointsUnnamed ItemMarkov decision processes for infinite horizon problems solved with the cosine simplex methodAn Incremental Fast Policy Search Using a Single Sample PathUnnamed ItemUnnamed ItemA forwards induction approach to candidate drug selectionExperimental Design for Partially Observed Markov Decision ProcessesUnnamed ItemAn Approach for Determining Stationary Equilibria in a Single-Controller Average Stochastic GameUnnamed ItemAverage Cost Brownian Drift Control with Proportional Changeover CostsStationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systemsComputing Behavioral Relations for Probabilistic Concurrent SystemsMulti-hop sensor network scheduling for optimal remote estimationA Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and StabilizationEffective Scenarios in Multistage Distributionally Robust Optimization with a Focus on Total Variation DistanceUnnamed ItemA queueing model for customer rescheduling and no-shows in service systemsA numerical study of Markov decision process algorithms for multi-component replacement problemsUnnamed ItemAnother set of verifiable conditions for average Markov decision processes with Borel spacesAn exponential lower bound for Zadeh's pivot ruleA scalable anticipatory policy for the dynamic pickup and delivery problemOff-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demandsOPTIMAL CONTROL OF A TWO-SERVER QUEUEING SYSTEM WITH FAILURESFormalization of methods for the development of autonomous artificial intelligence systemsOptimal policies for stochastic clearing systems with time‐dependent delay penaltiesBlock Policy Mirror DescentModel checking differentially private propertiesA unified algorithm framework for mean-variance optimization in discounted Markov decision processesSmoothing policies and safe policy gradientsA dynamic analytic method for risk-aware controlled martingale problemsA specification logic for programs in the probabilistic guarded command languageTask allocation and on-the-job trainingUnnamed ItemA framework to measure the robustness of programs in the unpredictable environmentOPTIMIZATION OF OVERFLOW POLICIES IN CALL CENTERSOptimal Routing of Fixed Size Jobs to Two Parallel ServersAdaptive constraint satisfaction for Markov decision process congestion games: application to transportation networksAverage cost minimization in a multi-server retrial queueing system with a controllable reserve group of serversPremium control with reinforcement learningOPTIMALLY REPLACING MULTIPLE SYSTEMS IN A SHARED ENVIRONMENTDistributionally Robust Partially Observable Markov Decision Process with Moment-Based AmbiguityOn the Value Function of the M/G/1 FCFS and LCFS QueuesLearning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular ConstraintsUnnamed ItemUnnamed ItemIterative Improvement of Lower and Upper Bounds for Backward SDEsScheduling services in a queuing system with impatience and setup costsDynamic Pricing with a Poisson Bandit ModelUnnamed ItemUnnamed ItemUnnamed ItemA CTMDP-Based Exact Method for RCPSP with Uncertain Activity Durations and ReworkUnnamed ItemOptimal Kullback–Leibler approximation of Markov chains via nuclear norm regularisationDynamic Decision Making in Energy Systems with Storage and Renewable Energy SourcesMinimising average passenger waiting time in personal rapid transit systemsFast value iteration: an application of Legendre-Fenchel duality to a class of deterministic dynamic programming problems in discrete timeUnnamed ItemCharacterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision ChainsUnnamed ItemA Survey of Bidding Games on Graphs (Invited Paper)A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs<html> Nash &epsilon;-equilibria for stochastic games with total reward functions: an approach through Markov decision processes</html>A Continuous-Time Markov Decision Process for Infrastructure SurveillanceTo wait or not to wait: Optimal ordering under lead time uncertainty and forecast updatingRepeated Sequential Prisoner's Dilemma: The Stackleberg VariantSolving the drift control problemSynchronization and control in intrinsic and designed computation: An information-theoretic analysis of competing models of stochastic computationEmpirical Q-Value IterationA Convex Analytic Approach to Risk-Aware Markov Decision ProcessesConcurrent MDPs with Finite Markovian PoliciesMultiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision ProcessesUnnamed ItemOn Nash Equilibria in Stochastic Positional Games with Average PayoffsUnnamed ItemMultiple stopping time POMDPs: structural results \& application in interactive advertising on social mediaDemand seasonality in retail inventory managementRobust decomposable Markov decision processes motivated by allocating school budgetsSolving dynamic public insurance games with endogenous agent distributions: theory and computational approximationRevenue management for operations with urgent ordersParametric replenishment policies for inventory systems with lost sales and fixed order costOptimal sensor scheduling for multiple linear dynamical systemsOptimal inventory management using retail prepacksFully probabilistic design of strategies with estimatorComparing strategies to prevent stroke and ischemic heart disease in the Tunisian population: Markov modeling approach using a comprehensive sensitivity analysis algorithmSolving stochastic resource-constrained project scheduling problems by closed-loop approximate dynamic programmingA two-state partially observable Markov decision process with three actionsContinue, quit, restart probability modelPerspectives of approximate dynamic programmingOptimal decisions for continuous time Markov decision processes over finite planning horizonsOn transition matrices of Markov chains corresponding to Hamiltonian cyclesA model for equilibrium in some service-provider user-set interactionsHeuristic decision rules for short-term trading of renewable energy with co-located energy storageGame theoretic interaction and decision: a quantum analysisA stochastic game approach to the security issue of networked control systems under jamming attacksScheduling of multi-class multi-server queueing systems with abandonmentsFrameworks and results in distributionally robust optimization




This page was built for publication: