Reinforcement Learning: A Tutorial Survey and Recent Advances
From MaRDI portal
Publication:2901057
DOI10.1287/ijoc.1080.0305zbMath1243.68240OpenAlexW2095487261MaRDI QIDQ2901057
Publication date: 28 July 2012
Published in: INFORMS Journal on Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/ijoc.1080.0305
Related Items (17)
A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning ⋮ Actor-Critic–Like Stochastic Adaptive Search for Continuous Simulation Optimization ⋮ Dynamic capacity planning using strategic slack valuation ⋮ Tabu search guided by reinforcement learning for the max-mean dispersion problem ⋮ Approximate stochastic annealing for online control of infinite horizon Markov decision processes ⋮ A reinforcement learning approach to convoy scheduling on a contested transportation network ⋮ Literature reviews in operations research: a new taxonomy and a meta review ⋮ A self‐adaptive SAC‐PID control approach based on reinforcement learning for mobile robots ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ Policy sharing between multiple mobile robots using decision trees ⋮ Q-learning-based target selection for bearings-only autonomous navigation ⋮ Approximate Dynamic Programming based on High Dimensional Model Representation ⋮ Continuous Action Generation of Q‐Learning in Multi‐Agent Cooperation ⋮ A critical review of the most popular types of neuro control ⋮ Scalable estimation strategies based on stochastic approximations: classical results and new insights ⋮ On Incomplete Learning and Certainty-Equivalence Control ⋮ An aggregation-based approximate dynamic programming approach for the periodic review model with random yield
This page was built for publication: Reinforcement Learning: A Tutorial Survey and Recent Advances