Pages that link to "Item:Q5391735"

From MaRDI portal

← Neuro-Dynamic Programming: An Overview and Recent Results (Q5391735)

Jump to:navigation, search

The following pages link to Neuro-Dynamic Programming: An Overview and Recent Results (Q5391735):

Displayed 30 items.

Parameter-free sampled fictitious play for solving deterministic dynamic programming problems (Q289136) ‎ (← links)
Approximate linear programming for networks: average cost bounds (Q342031) ‎ (← links)
Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731) ‎ (← links)
Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model (Q399890) ‎ (← links)
Random exploration of the procedural space for single-view 3D modeling of buildings (Q408934) ‎ (← links)
Optimization of a pumped-storage fixed-head hydroplant: the bang-singular-bang solution (Q410438) ‎ (← links)
Dynamic admission and service rate control of a queue (Q600905) ‎ (← links)
A survey of motion planning algorithms from the perspective of autonomous UAV guidance (Q614803) ‎ (← links)
Strategy optimization for controlled Markov process with descriptive complexity constraint (Q848403) ‎ (← links)
Approximate dynamic programming and its applications to the design of Phase I cancer trials (Q903295) ‎ (← links)
Assessment of the Cell Broadband Engine Architecture as a platform to solve closed-loop optimal control problems (Q991095) ‎ (← links)
Optimal stopping with a probabilistic constraint (Q1695821) ‎ (← links)
A stochastic control formalism for dynamic biologically conformal radiation therapy (Q1926672) ‎ (← links)
Stochastic optimization for real time service capacity allocation under random service demand (Q1931638) ‎ (← links)
Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains (Q1959632) ‎ (← links)
Coding and control for communication networks (Q2269497) ‎ (← links)
Average-case performance of rollout algorithms for knapsack problems (Q2349849) ‎ (← links)
Dynamic control in multi-item production/inventory systems (Q2362241) ‎ (← links)
On sample size control in sample average approximations for solving smooth stochastic programs (Q2376122) ‎ (← links)
Solution for a class of closed-loop leader-follower games with convexity conditions on the payoffs (Q2399326) ‎ (← links)
Statistical analysis of trajectories on Riemannian manifolds: bird migration, hurricane tracking and video surveillance (Q2453688) ‎ (← links)
Approximate policy iteration: a survey and some new methods (Q2887629) ‎ (← links)
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) ‎ (← links)
A Retrograde Approximation Algorithm for Multi-player Can’t Stop (Q3601837) ‎ (← links)
An Efficient Gradient Projection Method for Stochastic Optimal Control Problems (Q4596726) ‎ (← links)
Adaptive Simulation Selection for the Discovery of the Ground State Line of Binary Alloys with a Limited Computational Budget (Q4604867) ‎ (← links)
An Open-Loop Approach for a Stochastic Production Planning Problem with Remanufacturing Process (Q5245451) ‎ (← links)
Approximate dynamic programming via iterated Bellman inequalities (Q5256802) ‎ (← links)
Data-Efficient Quickest Change Detection with On–Off Observation Control (Q5389555) ‎ (← links)
Quadratic approximate dynamic programming for input‐affine systems (Q5409145) ‎ (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q5391735"