Neuro-Dynamic Programming: An Overview and Recent Results

From MaRDI portal
Publication:5391735


DOI10.1007/978-3-540-69995-8_11zbMath1209.90343MaRDI QIDQ5391735

Dimitri P. Bertsekas

Publication date: 7 April 2011

Published in: Operations Research Proceedings (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/978-3-540-69995-8_11


90C39: Dynamic programming

68T37: Reasoning under uncertainty in the context of artificial intelligence


Related Items

An Efficient Gradient Projection Method for Stochastic Optimal Control Problems, Adaptive Simulation Selection for the Discovery of the Ground State Line of Binary Alloys with a Limited Computational Budget, An Open-Loop Approach for a Stochastic Production Planning Problem with Remanufacturing Process, Approximate dynamic programming via iterated Bellman inequalities, Data-Efficient Quickest Change Detection with On–Off Observation Control, Quadratic approximate dynamic programming for input‐affine systems, Parameter-free sampled fictitious play for solving deterministic dynamic programming problems, Approximate linear programming for networks: average cost bounds, Q-learning and policy iteration algorithms for stochastic shortest path problems, Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model, Random exploration of the procedural space for single-view 3D modeling of buildings, Optimization of a pumped-storage fixed-head hydroplant: the bang-singular-bang solution, Dynamic admission and service rate control of a queue, A survey of motion planning algorithms from the perspective of autonomous UAV guidance, Strategy optimization for controlled Markov process with descriptive complexity constraint, Approximate dynamic programming and its applications to the design of Phase I cancer trials, Assessment of the Cell Broadband Engine Architecture as a platform to solve closed-loop optimal control problems, Optimal stopping with a probabilistic constraint, A stochastic control formalism for dynamic biologically conformal radiation therapy, Stochastic optimization for real time service capacity allocation under random service demand, Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains, Coding and control for communication networks, Average-case performance of rollout algorithms for knapsack problems, Dynamic control in multi-item production/inventory systems, On sample size control in sample average approximations for solving smooth stochastic programs, Solution for a class of closed-loop leader-follower games with convexity conditions on the payoffs, Statistical analysis of trajectories on Riemannian manifolds: bird migration, hurricane tracking and video surveillance, Approximate policy iteration: a survey and some new methods, A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications, A Retrograde Approximation Algorithm for Multi-player Can’t Stop