Feature-based methods for large scale dynamic programming

DOI10.1007/BF00114724MaRDI QIDQ1911341zbMATH OpenFDO

Authors John N. Tsitsiklis, Benjamin Van Roy

Publication date 21 April 1996

Published in Machine Learning (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1007/BF00114724

zbMATH Keywords

dynamic programming

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Recommendations

Cites work

Cited in

(19)

The actor-critic algorithm as multi-time-scale stochastic approximation.
Approximate dynamic programming for stochastic \(N\)-stage optimization with application to optimal consumption under uncertainty
Shape constraints in economics and operations research
Feature-based methods for large scale dynamic programming
Dynamic programming approximation algorithms for the capacitated lot-sizing problem
Automatic induction of Bellman-error features for probabilistic planning
Single sample path-based optimization of Markov chains
Offline reinforcement learning in large state spaces: algorithms and guarantees
Tetris: A study of randomized constraint sampling
Data-driven models for capacity allocation of inpatient beds in a Chinese public hospital
Refined performance estimation for the l-step lookahead policy in reinforcement learning
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
Approximate policy iteration: a survey and some new methods
Error analysis for approximate CVaR-optimal control with a maximum cost
Reinforcement learning in non-Markovian environments
Regularization and two time scales for convergence of reinforcement learning
Reinforcement learning in convergently non-stationary environments: feudal hierarchies and learned representations
The Benefits of State Aggregation with Extreme-Point Weighting for Assemble-to-Order Systems
Solving the k-sparse eigenvalue problem with reinforcement learning

This page was built for publication: Feature-based methods for large scale dynamic programming

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1911341)