The theory of dynamic programming

DOI10.1090/S0002-9904-1954-09848-8zbMath0057.12503OpenAlexW1980516134WikidataQ56115923 ScholiaQ56115923MaRDI QIDQ5830128

Richard Bellman

Publication date: 1954

Published in: Bulletin of the American Mathematical Society (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1090/s0002-9904-1954-09848-8

Mathematics Subject Classification ID

Dynamic programming (90C39)

Related Items

Time Consistency of the Mean-Risk Problem, On optimal segmentation and parameter tuning for multiple change-point detection and inference, Multiscale Quantile Segmentation, Canonical forms for stochastic nonlinear systems, Towards sustainable timber harvesting of homogeneous stands: dynamic programming in synergy with forest growth simulation, Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming, Temporal concatenation for Markov decision processes, Computational approaches for mixed integer optimal control problems with indicator constraints, Dynamic optimization and its relation to classical and quantum constrained systems, Second class constraints and the consistency of optimal control theory in phase space, An adaptive multi-spline refinement algorithm in simulation based sailboat trajectory optimization using onboard multi-core computer systems, New solution procedures for the order picker routing problem in U-shaped Pick areas with a movable depot, The quantum dark side of the optimal control theory, Model predictive control of cash balance in a cash concentration and disbursements system, Approximate dynamic programming with post-decision states as a solution method for dynamic economic models, ON THE INTERFACE BETWEEN OPTIMAL PERIODIC AND CONTINUOUS DIVIDEND STRATEGIES IN THE PRESENCE OF TRANSACTION COSTS, Stochastic decision diagrams, Initialization of the shooting method via the Hamilton-Jacobi-Bellman approach, On continuous-time infinite horizon optimal control -- dissipativity, stability, and transversality, A dual subspace parsimonious mixture of matrix normal distributions, What Next?, Multibody dynamics and control using machine learning, Cluster-based lateral transshipments for the Zambian health supply chain, The policy graph decomposition of multistage stochastic programming problems, Risk-averse optimization of reward-based coherent risk measures, Solving nonlinear and dynamic programming equations on extended \(b\)-metric spaces with the fixed-point technique, Hiring Secretaries over Time: The Benefit of Concurrent Employment, Dissipativity in infinite horizon optimal control and dynamic programming, Synthesis of distributed optimal control in the tracking problem for the optimization of thermal processes described by integro-differential equations, A duality-based proof of the triangle inequality for the Wasserstein distances, On the sample complexity of actor-critic method for reinforcement learning with function approximation, A survey of average cost problems in deterministic discrete-time control systems, Optimization of an economic ordering quantity model for non-instantaneous deteriorating items with ordering time constraint using dynamic programming, Cost-aware sequential diagnostics, From Reinforcement Learning to Deep Reinforcement Learning: An Overview, Dominance rules in combinatorial optimization problems, Maximizing reachability in a temporal graph obtained by assigning starting times to a collection of walks, Analysis and Numerical Approximation of Stationary Second-Order Mean Field Game Partial Differential Inclusions, On using dynamic programming for time warping in pattern recognition, Numerical solutions to continuous linear programming problems, Strengthening of the Kneser theorem on zeros of the solutions of the equation \(y+ p(x)y=0\), A unified framework for stochastic optimization, Reinforcement learning for long-run average cost., Some problems of the theory of dynamic programming, Dynamic Programming for an Optimal and Equitable Public Load Shedding Schedule, Unnamed Item, Cooperative and non-cooperative behaviour in the exploitation of a common renewable resource with environmental stochasticity, Quantitative model-checking of controlled discrete-time Markov processes, Branch-and-bound algorithms: a survey of recent advances in searching, branching, and pruning, A population-based fast algorithm for a billion-dimensional resource allocation problem with integer variables, A linear-quadratic Gaussian approach to dynamic information acquisition, Existence of solutions to the Cauchy problem and stability of kink- solutions of the nonlinear Schrödinger equation, Application of dynamical systems to the study of asymptotic properties of solutions to nonlinear higher-order differential equations, Time to the MRCA of a sample in a Wright-Fisher model with variable population size, Segmentation of choroidal boundary in enhanced depth imaging octs using a multiresolution texture based modeling in graph cuts, A boundary value problem for differential equations with a retarded argument, Variational optimisation by the solution of a series of Hamilton-Jacobi equations, Oscillatory properties of second order nonlinear differential equations, Suzdal conference--2. Proceedings of the international conference on dynamical systems and differential equations, Suzdal, Russia, July 1--6, 2002. Part 2. Transl. from the Russian, Optimisation model for multi-item multi-echelon supply chains with nested multi-level products, A rotating-grid upwind fast sweeping scheme for a class of Hamilton-Jacobi equations, Stochastic dynamic programming illuminates the link between environment, physiology, and evolution, Relationship between solutions of families of two-point boundary value problems and Cauchy problems, Feedback control problem of an SIR epidemic model based on the Hamilton-Jacobi-Bellman equation, Functional data clustering by projection into latent generalized hyperbolic subspaces, High-performance simulation-based algorithms for an alpine ski racer's trajectory optimization in heterogeneous computer systems, Loading tow trains ergonomically for just-in-time part supply, A breakpoint detection in the mean model with heterogeneous variance on fixed time intervals, Computational aspects of optimal strategic network diffusion, Some Functional Equations in the Theory of Dynamic Programming. I. Functions of Points and Point Transformations, Stability in a neighborhood of a certain state, On the Bogolyubov-Mitropol'skij averaging principle for a class of second order hyperbolic equations, A theorem on averaging for hyperbolic systems of first order, The reduction principle in the Banach space, Variational problems with constraints, Functional equations in the theory of dynamic programming. III, The theory of dynamic programming, Eigenvalues and Functional Equations, The truncation of a countable system of partial differential equations, Deep reinforcement learning for inventory control: a roadmap, Ratcheting with a bliss level of consumption, The difference and unity of irregular LQ control and standard LQ control and its solution, Asymptotic series for the solution to the Cauchy problem, A scheduling problem in the baking industry, Political behavior of a deputy and voting prediction in a legislative body, A Linear Programming Approach to Sequential Hypothesis Testing, A new look at Bellman's principle of optimality, Qualitative analysis of families of bounded solutions of the nonlinear three-dimensional Schrödinger equation, Model-based Reinforcement Learning: A Survey, The maximum principle, Bellman's equation, and Carathéodory's work, High-order fully actuated system approaches: Part I. Models and basic procedure, Improving the filtering of branch-and-bound MDD solver

Cites Work