scientific article; zbMATH DE number 5685899

From MaRDI portal

Revision as of 23:03, 8 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:5305630

Jump to:navigation, search

zbMath1184.90170MaRDI QIDQ5305630

Martin L. Puterman

Publication date: 22 March 2010

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)

Related Items (only showing first 100 items - show all)

Multiple stopping time POMDPs: structural results \& application in interactive advertising on social media ⋮ Demand seasonality in retail inventory management ⋮ Robust decomposable Markov decision processes motivated by allocating school budgets ⋮ Solving dynamic public insurance games with endogenous agent distributions: theory and computational approximation ⋮ Revenue management for operations with urgent orders ⋮ Parametric replenishment policies for inventory systems with lost sales and fixed order cost ⋮ Optimal sensor scheduling for multiple linear dynamical systems ⋮ Optimal inventory management using retail prepacks ⋮ Fully probabilistic design of strategies with estimator ⋮ Comparing strategies to prevent stroke and ischemic heart disease in the Tunisian population: Markov modeling approach using a comprehensive sensitivity analysis algorithm ⋮ Solving stochastic resource-constrained project scheduling problems by closed-loop approximate dynamic programming ⋮ A two-state partially observable Markov decision process with three actions ⋮ Continue, quit, restart probability model ⋮ Perspectives of approximate dynamic programming ⋮ Optimal decisions for continuous time Markov decision processes over finite planning horizons ⋮ On transition matrices of Markov chains corresponding to Hamiltonian cycles ⋮ A model for equilibrium in some service-provider user-set interactions ⋮ Heuristic decision rules for short-term trading of renewable energy with co-located energy storage ⋮ Game theoretic interaction and decision: a quantum analysis ⋮ A stochastic game approach to the security issue of networked control systems under jamming attacks ⋮ Scheduling of multi-class multi-server queueing systems with abandonments ⋮ Frameworks and results in distributionally robust optimization ⋮ Robust optimal strategies in Markov decision problems ⋮ A non-penalty recurrent neural network for solving a class of constrained optimization problems ⋮ A multi-objective approach for PH-graphs with applications to stochastic shortest paths ⋮ On the computation of Whittle's index for Markovian restless bandits ⋮ Optimal supervisory control with mean payoff objectives and under partial observation ⋮ When are emptiness and containment decidable for probabilistic automata? ⋮ Bidding mechanisms in graph games ⋮ Stochastic reachability of a target tube: theory and computation ⋮ Efficient incremental planning and learning with multi-valued decision diagrams ⋮ Solving generic nonarchimedean semidefinite programs using stochastic game algorithms ⋮ Lost-sales inventory systems with a service level criterion ⋮ Improved utilization for joint HCCA-EDCA access in IEEE 802.11e WLANs ⋮ Infinite-duration poorman-bidding games ⋮ On budget balance of the dynamic pivot mechanism ⋮ Fuzzy Markovian decision processes: application to queueing systems ⋮ Bayesian optimistic Kullback-Leibler exploration ⋮ Optimal dynamic resource allocation to prevent defaults ⋮ Space-efficient scheduling of stochastically generated tasks ⋮ Renewable resource management with stochastic recharge and environmental threats ⋮ Offline reinforcement learning with task hierarchies ⋮ Integrating stochastic reasoning into Event-B development ⋮ Preference-based reinforcement learning: a formal framework and a policy iteration algorithm ⋮ On discounted dynamic programming with unbounded returns ⋮ Discounted dynamic programming with unbounded returns: application to economic models ⋮ A mean field approach for optimization in discrete time ⋮ The value function of an infinite-horizon single-item lot-sizing problem ⋮ Approximate dynamic programming for capacity allocation in the service industry ⋮ Optimal denial-of-service attack energy management against state estimation over an SINR-based network ⋮ Computational bounds for elevator control policies by large scale linear programming ⋮ Dynamic speed scaling minimizing expected energy consumption for real-time tasks ⋮ A necessary condition for Nash equilibrium in two-person zero-sum constrained stochastic games ⋮ A survey on skill-based routing with applications to service operations management ⋮ A stochastic approach to optimize Maritime pine (\textit{Pinus pinaster} Ait.) stand management scheduling under fire risk. An application in Portugal ⋮ Sensitivity-based nested partitions for solving finite-horizon Markov decision processes ⋮ Distributed adaptive dynamic programming for data-driven optimal control ⋮ Computation of weighted sums of rewards for concurrent MDPs ⋮ On the hardness of analyzing probabilistic programs ⋮ An approximate dynamic programming approach for sequential pig marketing decisions at herd level ⋮ A hybrid simulation-optimization algorithm for the Hamiltonian cycle problem ⋮ Optimal strategies for a fishery model applied to utility functions ⋮ Identifying proactive ICU patient admission, transfer and diversion policies in a public-private hospital network ⋮ Determining the optimal strategies for zero-sum average stochastic positional games ⋮ Attack allocation on remote state estimation in multi-systems: structural results and asymptotic solution ⋮ A policy iteration algorithm for the American put option and free boundary control problems ⋮ Model-based testing of probabilistic systems ⋮ An approximate dynamic programming approach to project scheduling with uncertain resource availabilities ⋮ A nested family of \(k\)-total effective rewards for positional games ⋮ Customizing exponential semi-Markov decision processes under the discounted cost criterion ⋮ Dynamic expediting of an urgent order with uncertain progress ⋮ Cooperation dynamics in repeated games of adverse selection ⋮ Production and availability policies through the Markov decision process and myopic methods for contractual and selective orders ⋮ A general approach for population games with application to vaccination ⋮ Determining the optimal strategies for discrete control problems on stochastic networks with discounted costs ⋮ Providing radiology health care services to stochastic demand of different customer classes ⋮ Light robustness in the optimization of Markov decision processes with uncertain parameters ⋮ Dynamic pricing in a production system with multiple demand classes ⋮ Sell or store? An ADP approach to marketing renewable energy ⋮ Erlang loss bounds for OT-ICU systems ⋮ An intelligent packet loss control heuristic for connectionless real-time voice communication ⋮ On infinite horizon active fault diagnosis for a class of non-linear non-Gaussian systems ⋮ Dual-based methods for solving infinite-horizon nonstationary deterministic dynamic programs ⋮ Markov decision processes with quasi-hyperbolic discounting ⋮ Optimal control in dynamic food supply chains using big data ⋮ What foreclosed homes should a municipality purchase to stabilize vulnerable neighborhoods? ⋮ Applications of stochastic modeling in air traffic management: methods, challenges and opportunities for solving air traffic problems under uncertainty ⋮ A stochastic dynamic pricing model for the multiclass problems in the airline industry ⋮ Stochastic dynamic programming model for optimal resource allocation in vehicular ad hoc networks ⋮ Analysis of customer lifetime value and marketing expenditure decisions through a Markovian-based model ⋮ Inferring expected runtimes of probabilistic integer programs using expected sizes ⋮ Condition-dependent mate choice: a stochastic dynamic programming approach ⋮ Asymptotically optimal index policies for an abandonment queue with convex holding cost ⋮ Equilibrium points and equilibrium sets of some \(GI /M/1\) queues ⋮ A pseudo-linear time algorithm for the optimal discrete speed minimizing energy consumption ⋮ The operator approach to entropy games ⋮ Optimal dynamic mining policy of blockchain selfish mining through sensitivity-based optimization ⋮ From reinforcement learning to optimal control: a unified framework for sequential decisions ⋮ Time and inventory dependent optimal maintenance policies for single machine workstations: an MDP approach ⋮ How adaptive and reliable is your program?

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5305630&oldid=19971670"