On the Theory of Dynamic Programming

From MaRDI portal

Publication:5812624

Jump to:navigation, search

DOI10.1073/pnas.38.8.716zbMath0047.13802OpenAlexW2056653303WikidataQ33712713 ScholiaQ33712713MaRDI QIDQ5812624

Richard Bellman

Publication date: 1952

Published in: Proceedings of the National Academy of Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1073/pnas.38.8.716

zbMATH Keywords

mathematical biology, operations research

Related Items

A current-value Hamiltonian approach to discrete-time optimal control problems in economic growth theory ⋮ On the Bellman's principle of optimality ⋮ Bovine mastitis and optimal disease management: dynamic programming analysis ⋮ Optimal life-insurance selection and purchase within a market of several life-insurance providers ⋮ Optimal switchover times between two activities utilizing the same resource ⋮ Computation of mutual information from hidden Markov models ⋮ A partial history of the early development of continuous-time nonlinear stochastic systems theory ⋮ Unnamed Item ⋮ Algebraic dynamic programming on trees ⋮ Asynchronous stochastic approximation with differential inclusions ⋮ Scaled relative graphs: nonexpansive operators via 2D Euclidean geometry ⋮ Model predictive control of cash balance in a cash concentration and disbursements system ⋮ On some variational problems occurring in the theory of dynamic programming ⋮ Active inference and agency: optimal control without cost functions ⋮ Multi-operator based biogeography based optimization with mutation for global numerical optimization ⋮ Optimal control for uncertain random singular systems with multiple time-delays ⋮ Deep policy dynamic programming for vehicle routing problems ⋮ Optimal assignment of sellers in a store with a random number of clientsviathe Armed Bandit model ⋮ Risk-averse dynamic programming for Markov decision processes ⋮ Reinforcement learning for combinatorial optimization: a survey ⋮ Dynamic programming for semi-Markov modulated SDEs ⋮ A survey of numerical solutions for stochastic control problems: some recent progress ⋮ Dynamic programming for a Markov-switching jump-diffusion ⋮ Direct and indirect optimal control applied to plant virus propagation with seasonality and delays ⋮ Pricing and risk of swing contracts in natural gas markets ⋮ Employing reinforcement learning to enhance particle swarm optimization methods ⋮ Optimal social welfare policy within financial and life insurance markets ⋮ Optimal investment strategies for pension funds with regulation-conform dynamic pension payment management in the absence of guarantees ⋮ Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function ⋮ Distorted probability operator for dynamic portfolio optimization in times of socio-economic crisis ⋮ Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization ⋮ Robust optimal control of logical control networks with function perturbation ⋮ An application of dynamic programming principle in corporate international optimal investment and consumption choice problem ⋮ Generating probabilistic safety guarantees for neural network controllers ⋮ Error estimates of finite element approximations for problems in linear elasticity. III: Problems in elastodynamics; discrete time approximations ⋮ Optimization of an economic ordering quantity model for non-instantaneous deteriorating items with ordering time constraint using dynamic programming ⋮ Entropy regularized actor-critic based multi-agent deep reinforcement learning for stochastic games ⋮ Neural network approximation and estimation of classifiers with classification boundary in a Barron class ⋮ Model-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian games ⋮ A modified Huber loss function for continual reassessment methods in clinical trials ⋮ Optimal harvesting for a logistic growth model with predation and a constant elasticity of variance ⋮ Time optimal control of triple integrator with input saturation and full state constraints ⋮ Probabilistically distorted risk-sensitive infinite-horizon dynamic programming ⋮ Active Inference, Curiosity and Insight ⋮ Scheduling results applicable to decision-theoretic troubleshooting ⋮ Optimization of market stochastic dynamics ⋮ Optimal feedback control for linear systems with input delays revisited ⋮ Sequential decision problems on isolated time domains ⋮ Two-phase selective decentralization to improve reinforcement learning systems with MDP ⋮ Stochastic finite-state systems in control theory ⋮ Planning and navigation as active inference ⋮ Controlled Markov decision processes with AVaR criteria for unbounded costs ⋮ Symbolic approximate time-optimal control ⋮ The genesis of differential games in light of Isaacs' contributions ⋮ Comparative calculation of the fuel-optimal operating strategy for diesel hybrid railway vehicles ⋮ Minimum cost path problems with relays ⋮ Dynamic programming algorithms for computing power indices in weighted multi-tier games ⋮ Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs ⋮ A perturb biogeography based optimization with mutation for global numerical optimization ⋮ Robust optimal control using conditional risk mappings in infinite horizon ⋮ Convergence of the control parametrization Ritz method for nonlinear optimal control problems ⋮ Continuous-Time Robust Dynamic Programming ⋮ Stochastic dynamic programming illuminates the link between environment, physiology, and evolution ⋮ Numerical solution of the parametric diffusion equation by deep neural networks ⋮ The optimization of K-effect models by linear and dynamic programming ⋮ A novel hybrid differential evolution and particle swarm optimization algorithm for unconstrained optimization ⋮ Some Functional Equations in the Theory of Dynamic Programming. I. Functions of Points and Point Transformations ⋮ A new analytical method for solving a class of nonlinear optimal control problems ⋮ Data-driven optimal control with a relaxed linear program ⋮ Free energy, value, and attractors ⋮ The theory of dynamic programming ⋮ Nonlinear optimal control: a numerical scheme based on occupation measures and interval analysis ⋮ Active Inference: Demystified and Compared ⋮ Sophisticated Inference ⋮ Verallgemeinerung des Lemmas von Gronwall und Bellman ⋮ Optimal Control with Budget Constraints and Resets ⋮ Efficient Estimation of Optimal Regimes Under a No Direct Effect Assumption ⋮ Tractable minor-free generalization of planar zero-field Ising models ⋮ System planning and configuration problems for optimal system design ⋮ Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges ⋮ High-order fully actuated system approaches: Part I. Models and basic procedure ⋮ Peril, prudence and planning as risk, avoidance and worry ⋮ An Option-Based Operational Risk Management Model for Pandemics ⋮ High-order fully actuated system approaches: Part VIII. Optimal control with application in spacecraft attitude stabilisation ⋮ A New Function Space from Barron Class and Application to Neural Network Approximation

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5812624&oldid=30624810"