Pages that link to "Item:Q5391735"
From MaRDI portal
The following pages link to Neuro-Dynamic Programming: An Overview and Recent Results (Q5391735):
Displayed 30 items.
- Parameter-free sampled fictitious play for solving deterministic dynamic programming problems (Q289136) (← links)
- Approximate linear programming for networks: average cost bounds (Q342031) (← links)
- Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731) (← links)
- Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model (Q399890) (← links)
- Random exploration of the procedural space for single-view 3D modeling of buildings (Q408934) (← links)
- Optimization of a pumped-storage fixed-head hydroplant: the bang-singular-bang solution (Q410438) (← links)
- Dynamic admission and service rate control of a queue (Q600905) (← links)
- A survey of motion planning algorithms from the perspective of autonomous UAV guidance (Q614803) (← links)
- Strategy optimization for controlled Markov process with descriptive complexity constraint (Q848403) (← links)
- Approximate dynamic programming and its applications to the design of Phase I cancer trials (Q903295) (← links)
- Assessment of the Cell Broadband Engine Architecture as a platform to solve closed-loop optimal control problems (Q991095) (← links)
- Optimal stopping with a probabilistic constraint (Q1695821) (← links)
- A stochastic control formalism for dynamic biologically conformal radiation therapy (Q1926672) (← links)
- Stochastic optimization for real time service capacity allocation under random service demand (Q1931638) (← links)
- Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains (Q1959632) (← links)
- Coding and control for communication networks (Q2269497) (← links)
- Average-case performance of rollout algorithms for knapsack problems (Q2349849) (← links)
- Dynamic control in multi-item production/inventory systems (Q2362241) (← links)
- On sample size control in sample average approximations for solving smooth stochastic programs (Q2376122) (← links)
- Solution for a class of closed-loop leader-follower games with convexity conditions on the payoffs (Q2399326) (← links)
- Statistical analysis of trajectories on Riemannian manifolds: bird migration, hurricane tracking and video surveillance (Q2453688) (← links)
- Approximate policy iteration: a survey and some new methods (Q2887629) (← links)
- A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) (← links)
- A Retrograde Approximation Algorithm for Multi-player Can’t Stop (Q3601837) (← links)
- An Efficient Gradient Projection Method for Stochastic Optimal Control Problems (Q4596726) (← links)
- Adaptive Simulation Selection for the Discovery of the Ground State Line of Binary Alloys with a Limited Computational Budget (Q4604867) (← links)
- An Open-Loop Approach for a Stochastic Production Planning Problem with Remanufacturing Process (Q5245451) (← links)
- Approximate dynamic programming via iterated Bellman inequalities (Q5256802) (← links)
- Data-Efficient Quickest Change Detection with On–Off Observation Control (Q5389555) (← links)
- Quadratic approximate dynamic programming for input‐affine systems (Q5409145) (← links)