Algorithms for Reinforcement Learning
DOI10.2200/S00268ED1V01Y201005AIM009zbMath1205.68320OpenAlexW4211221179MaRDI QIDQ3588852
Publication date: 10 September 2010
Published in: Synthesis Lectures on Artificial Intelligence and Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2200/s00268ed1v01y201005aim009
simulationMarkov decision processesstochastic approximationonline learningreinforcement learningplanningactive learningtemporal difference learningfunction approximationleast-squares methodsQ-learningPAC-learningnatural gradientbias-variance tradeoffoverfittingstochastic gradient methodspolicy gradientactor-critic methods
Nonnumerical algorithms (68W05) Learning and adaptive systems in artificial intelligence (68T05) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to computer science (68-02)
Related Items (41)
Uses Software
This page was built for publication: Algorithms for Reinforcement Learning