scientific article; zbMATH DE number 1455133
From MaRDI portal
Publication:4484923
zbMath0955.91009MaRDI QIDQ4484923
Publication date: 5 June 2000
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov decision processesoptimal stoppingnetworksstochastic modelsstatistical decisionsaverage costsmulti-stage decision problems
Decision theory (91B06) Dynamic programming (90C39) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to operations research and mathematical programming (90-01) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to game theory, economics, and finance (91-01)
Related Items (13)
Tree-based reinforcement learning for estimating optimal dynamic treatment regimes ⋮ Using geometric extrema for segment-to-segment characteristics comparison in online signature verification ⋮ Randomized Shortest-Path Problems: Two Related Models ⋮ Optimal intervention for an epidemic model under parameter uncertainty ⋮ Adaptive contrast weighted learning for multi‐stage multi‐treatment decision‐making ⋮ Estimating Tree-Based Dynamic Treatment Regimes Using Observational Data with Restricted Treatment Sequences ⋮ Reinforced Risk Prediction With Budget Constraint Using Irregularly Measured Data From Electronic Health Records ⋮ Markov decision processes on Borel spaces with total cost and random horizon ⋮ Optimal Dynamic Treatment Regimes ⋮ SINGLE VEHICLE ROUTING PROBLEMS WITH A PREDEFINED CUSTOMER ORDER, UNIFIED LOAD AND STOCHASTIC DISCRETE DEMANDS ⋮ A semi-Markov decision model for the optimal control of a simple immigration-birth-death process through the introduction of a predator ⋮ \(Q\)- and \(A\)-learning methods for estimating optimal dynamic treatment regimes ⋮ Finite and infinite-horizon single vehicle routing problems with a predefined customer sequence and pickup and delivery
This page was built for publication: