scientific article

From MaRDI portal

Publication:3245701

Jump to:navigation, search

zbMath0078.34101MaRDI QIDQ3245701

Richard Bellman

Publication date: 1957

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)

Related Items (max. 100)

Deep reinforcement learning in finite-horizon to explore the most probable transition pathway ⋮ Certified reinforcement learning with logic guidance ⋮ Mixed nondeterministic-probabilistic automata: blending graphical probabilistic models with nondeterminism ⋮ On how to exploit a population given by a difference equation with random parameters ⋮ A denotational semantics for low-level probabilistic programs with nondeterminism ⋮ Computational aspects in applied stochastic control ⋮ Preference change ⋮ A novel state-transition forest: pricing corporate securities with intertemporal exercise policies and corresponding capital structure changes ⋮ A methodology for computation reduction for specially structured large scale Markov decision problems ⋮ Unnamed Item ⋮ A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning ⋮ Design and evaluation of norm-aware agents based on normative Markov decision processes ⋮ On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes ⋮ Restricted gradient-descent algorithm for value-function approximation in reinforcement learning ⋮ Dynamic diagnostic and decision procedures under uncertainty ⋮ Model Checking Linear-Time Properties of Probabilistic Systems ⋮ Stable sequential control rules and Markov chains ⋮ Markovian sequential control processes. Denumerable state space ⋮ Finite-Memory Strategies in POMDPs with Long-Run Average Objectives ⋮ The browser war -- analysis of Markov perfect equilibrium in markets with dynamic demand effects ⋮ Computing Behavioral Relations for Probabilistic Concurrent Systems ⋮ Computing semi-stationary optimal policies for multichain semi-Markov decision processes ⋮ Reinforcement learning for combinatorial optimization: a survey ⋮ Unnamed Item ⋮ SHORTFALL RISK MINIMIZATION UNDER FIXED TRANSACTION COSTS ⋮ Optimal strategies in the fighting fantasy gaming system: influencing stochastic dynamics by gambling with limited resource ⋮ Dynamic dispatching and repositioning policies for fast-response service networks ⋮ A human-robot collaborative reinforcement learning algorithm ⋮ Value-Gradient Based Formulation of Optimal Control Problem and Machine Learning Algorithm ⋮ Optimal management of stochastic invasion in a metapopulation with Allee effects ⋮ A Markovian decision model of adaptive cancer treatment and quality of life ⋮ Algebraic optimization of sequential decision problems ⋮ Pricing tenure payment reverse mortgages with optimal exercised prepayment options by accounting for house prices, interest rates, and mortality risk ⋮ Approximate Newton Policy Gradient Algorithms ⋮ Quantitative controller synthesis for consumption Markov decision processes ⋮ Human-cyber-physical automata and their synthesis ⋮ The method of value oriented successive approximations for the average reward Markov decision process ⋮ Unnamed Item ⋮ A survey of average cost problems in deterministic discrete-time control systems ⋮ Unnamed Item ⋮ Closed-loop supply chain inventory management with recovery information of reusable containers ⋮ Unnamed Item ⋮ Polynomial Approximation of High-Dimensional Hamilton--Jacobi--Bellman Equations and Applications to Feedback Control of Semilinear Parabolic PDEs ⋮ An intelligent choice of witnesses in the Miller-Rabin primality test. Reinforcement learning approach ⋮ Reinforcement learning for optimal error correction of toric codes ⋮ Value set iteration for Markov decision processes ⋮ Unnamed Item ⋮ OL-DEC-MDP model for multiagent online scheduling with a time-dependent probability of success ⋮ SLAP: specification logic of actions with probability ⋮ Control: a perspective ⋮ Unnamed Item ⋮ Lexicographic refinements in stationary possibilistic Markov decision processes ⋮ OPTIMALLY REPLACING MULTIPLE SYSTEMS IN A SHARED ENVIRONMENT ⋮ A review of operations research models in invasive species management: state of the art, challenges, and future directions ⋮ Unnamed Item ⋮ Dynamic lookahead policies for stochastic-dynamic inventory routing in bike sharing systems ⋮ Stochastic finite-state systems in control theory ⋮ Unnamed Item ⋮ Meta-modeling game for deriving theory-consistent, microstructure-based traction-separation laws via deep reinforcement learning ⋮ Structures and methods of dynamical decision-making ⋮ Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisited ⋮ A review on deep reinforcement learning for fluid mechanics ⋮ Quantitative model-checking of controlled discrete-time Markov processes ⋮ Control of chaotic systems by deep reinforcement learning ⋮ Probabilistic timed graph transformation systems ⋮ Dynamic journeying under uncertainty ⋮ Engineering constraint solvers for automatic analysis of probabilistic hybrid automata ⋮ An optimality principle for Markovian decision processes ⋮ Solving stochastic dynamic programming problems by linear programming — An annotated bibliography ⋮ Pursuit of food \textit{versus} pursuit of information in a Markovian perception-action loop model of foraging ⋮ Unnamed Item ⋮ Contraction mappings underlying undiscounted Markov decision problems ⋮ Stochastic revision opportunities in Markov decision problems ⋮ Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language + ⋮ Discrete Dividend Payments in Continuous Time ⋮ Conditional Probabilities over Probabilistic and Nondeterministic Systems ⋮ Unnamed Item ⋮ The optimization of K-effect models by linear and dynamic programming ⋮ Linear programming considerations on Markovian decision processes with no discounting ⋮ Strong Uniform Value in Gambling Houses and Partially Observable Markov Decision Processes ⋮ Recomposable restricted finite state machines: definition and solution approaches ⋮ Dynamic programming and optimal control of variable multichannel stochastic service systems with applications ⋮ Linear programming algorithms for semi-Markovian decision processes ⋮ MAXIMIZING THE GROWTH RATE UNDER RISK CONSTRAINTS ⋮ Unnamed Item ⋮ On a set of optimal policies in continuous time Markovian decision problem ⋮ New classes of stochastic control processes ⋮ Stage-\(t\) scenario dominance for risk-averse multi-stage stochastic mixed-integer programs ⋮ History-dependent Evaluations in Partially Observable Markov Decision Process ⋮ Ultimate precision of joint parameter estimation under noisy Gaussian environment ⋮ Belief base contraction by belief accrual ⋮ Functional equations in the theory of dynamic programming. XI: Limit theorems ⋮ On the solvability of Bellman's functional equation for a Markovian decision process ⋮ Learning with policy prediction in continuous state-action multi-agent decision processes ⋮ Solving sequential collective decision problems under qualitative uncertainty ⋮ Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms ⋮ VPint: value propagation-based spatial interpolation ⋮ Optimal and near-optimal incentive strategies in the hierarchical control of Markov chains ⋮ Quantifying quantum correlations in noisy Gaussian channels ⋮ Explainable dynamic programming

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3245701&oldid=16375348"