On the Theory of Dynamic Programming

From MaRDI portal
Publication:5812624

DOI10.1073/pnas.38.8.716zbMath0047.13802OpenAlexW2056653303WikidataQ33712713 ScholiaQ33712713MaRDI QIDQ5812624

Richard Bellman

Publication date: 1952

Published in: Proceedings of the National Academy of Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1073/pnas.38.8.716




Related Items

A current-value Hamiltonian approach to discrete-time optimal control problems in economic growth theoryOn the Bellman's principle of optimalityBovine mastitis and optimal disease management: dynamic programming analysisOptimal life-insurance selection and purchase within a market of several life-insurance providersOptimal switchover times between two activities utilizing the same resourceComputation of mutual information from hidden Markov modelsA partial history of the early development of continuous-time nonlinear stochastic systems theoryUnnamed ItemAlgebraic dynamic programming on treesAsynchronous stochastic approximation with differential inclusionsScaled relative graphs: nonexpansive operators via 2D Euclidean geometryModel predictive control of cash balance in a cash concentration and disbursements systemOn some variational problems occurring in the theory of dynamic programmingActive inference and agency: optimal control without cost functionsMulti-operator based biogeography based optimization with mutation for global numerical optimizationOptimal control for uncertain random singular systems with multiple time-delaysDeep policy dynamic programming for vehicle routing problemsOptimal assignment of sellers in a store with a random number of clientsviathe Armed Bandit modelRisk-averse dynamic programming for Markov decision processesReinforcement learning for combinatorial optimization: a surveyDynamic programming for semi-Markov modulated SDEsA survey of numerical solutions for stochastic control problems: some recent progressDynamic programming for a Markov-switching jump-diffusionDirect and indirect optimal control applied to plant virus propagation with seasonality and delaysPricing and risk of swing contracts in natural gas marketsEmploying reinforcement learning to enhance particle swarm optimization methodsOptimal social welfare policy within financial and life insurance marketsOptimal investment strategies for pension funds with regulation-conform dynamic pension payment management in the absence of guaranteesAdaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier functionDistorted probability operator for dynamic portfolio optimization in times of socio-economic crisisFast Global Convergence of Natural Policy Gradient Methods with Entropy RegularizationRobust optimal control of logical control networks with function perturbationAn application of dynamic programming principle in corporate international optimal investment and consumption choice problemGenerating probabilistic safety guarantees for neural network controllersError estimates of finite element approximations for problems in linear elasticity. III: Problems in elastodynamics; discrete time approximationsOptimization of an economic ordering quantity model for non-instantaneous deteriorating items with ordering time constraint using dynamic programmingEntropy regularized actor-critic based multi-agent deep reinforcement learning for stochastic gamesNeural network approximation and estimation of classifiers with classification boundary in a Barron classModel-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian gamesA modified Huber loss function for continual reassessment methods in clinical trialsOptimal harvesting for a logistic growth model with predation and a constant elasticity of varianceTime optimal control of triple integrator with input saturation and full state constraintsProbabilistically distorted risk-sensitive infinite-horizon dynamic programmingActive Inference, Curiosity and InsightScheduling results applicable to decision-theoretic troubleshootingOptimization of market stochastic dynamicsOptimal feedback control for linear systems with input delays revisitedSequential decision problems on isolated time domainsTwo-phase selective decentralization to improve reinforcement learning systems with MDPStochastic finite-state systems in control theoryPlanning and navigation as active inferenceControlled Markov decision processes with AVaR criteria for unbounded costsSymbolic approximate time-optimal controlThe genesis of differential games in light of Isaacs' contributionsComparative calculation of the fuel-optimal operating strategy for diesel hybrid railway vehiclesMinimum cost path problems with relaysDynamic programming algorithms for computing power indices in weighted multi-tier gamesEfficient approximation of solutions of parametric linear transport equations by ReLU DNNsA perturb biogeography based optimization with mutation for global numerical optimizationRobust optimal control using conditional risk mappings in infinite horizonConvergence of the control parametrization Ritz method for nonlinear optimal control problemsContinuous-Time Robust Dynamic ProgrammingStochastic dynamic programming illuminates the link between environment, physiology, and evolutionNumerical solution of the parametric diffusion equation by deep neural networksThe optimization of K-effect models by linear and dynamic programmingA novel hybrid differential evolution and particle swarm optimization algorithm for unconstrained optimizationSome Functional Equations in the Theory of Dynamic Programming. I. Functions of Points and Point TransformationsA new analytical method for solving a class of nonlinear optimal control problemsData-driven optimal control with a relaxed linear programFree energy, value, and attractorsThe theory of dynamic programmingNonlinear optimal control: a numerical scheme based on occupation measures and interval analysisActive Inference: Demystified and ComparedSophisticated InferenceVerallgemeinerung des Lemmas von Gronwall und BellmanOptimal Control with Budget Constraints and ResetsEfficient Estimation of Optimal Regimes Under a No Direct Effect AssumptionTractable minor-free generalization of planar zero-field Ising modelsSystem planning and configuration problems for optimal system designMulti-armed bandit models for the optimal design of clinical trials: benefits and challengesHigh-order fully actuated system approaches: Part I. Models and basic procedurePeril, prudence and planning as risk, avoidance and worryAn Option-Based Operational Risk Management Model for PandemicsHigh-order fully actuated system approaches: Part VIII. Optimal control with application in spacecraft attitude stabilisationA New Function Space from Barron Class and Application to Neural Network Approximation