Search results
From MaRDI portal
- reward temporal-difference learning 2002-07-08 Paper On the existence of fixed points for approximate value iteration and temporal-difference learning 2001-02-19...10 bytes (19 words) - 18:58, 11 December 2023
- on-policy reinforcement-learning algorithms 2000-06-21 Paper Analytical mean squared error curves for temporal difference learning 1998-09-07 Paper Reinforcement...10 bytes (18 words) - 12:28, 13 December 2023
- Bellman Equations and Temporal-Difference Learning 2020-08-05 Paper On Generalized Bellman Equations and Temporal-Difference Learning 2018-11-21 Paper https://portal...10 bytes (18 words) - 09:55, 7 October 2023
- mining. Self-learning techniques for recommendation engines 2013-08-07 Paper Multilevel Preconditioners for Temporal-Difference Learning Methods Related...10 bytes (16 words) - 17:50, 9 December 2023
- existence of fixed points for approximate value iteration and temporal-difference learning 2001-02-19 Paper Output feedback control of Markov jump linear...10 bytes (20 words) - 17:20, 13 December 2023
- methods 2021-04-20 Paper On Generalized Bellman Equations and Temporal-Difference Learning 2020-08-05 Paper https://portal.mardi4nfdi.de/entity/Q4626283...10 bytes (19 words) - 21:59, 22 September 2023
- Publication Date of Publication Type The application of temporal difference learning in optimal diet models 2019-05-14 Paper...10 bytes (16 words) - 14:13, 6 October 2023
- Publication Date of Publication Type A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation 2021-07-29 Paper...10 bytes (16 words) - 23:16, 27 December 2023
- Publication Date of Publication Type Off-policy temporal difference learning with distribution adaptation in fast mixing chains 2018-10-22 Paper...10 bytes (16 words) - 22:03, 24 September 2023
- Publication Date of Publication Type Off-policy temporal difference learning with distribution adaptation in fast mixing chains 2018-10-22 Paper...10 bytes (16 words) - 22:03, 24 September 2023
- Paper On weak learning 1996-04-29 Paper On the worst-case analysis of temporal-difference learning algorithms 1996-04-21 Paper Learning binary relations...10 bytes (19 words) - 06:04, 9 December 2023
- analysis of temporal-difference learning algorithms with constant step-sizes 2006-11-22 Paper Asymptotic analysis of temporal-difference learning algorithms...10 bytes (18 words) - 06:29, 7 October 2023
- de/entity/Q2768395 2003-07-29 Paper Technical update: Least-squares temporal difference learning 2002-07-08 Paper 10.1162/15324430152733124 2002-04-03 Paper...10 bytes (18 words) - 17:39, 24 September 2023
- Prefrontal Cortex 2019-06-04 Paper Hyperbolically Discounted Temporal Difference Learning 2010-06-11 Paper...10 bytes (18 words) - 19:44, 26 December 2023
- Publication Date of Publication Type A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation 2021-07-29 Paper On the tightness...10 bytes (16 words) - 19:14, 24 September 2023
- de/entity/Q5477859 2006-06-29 Paper Linear least-squares algorithms for temporal difference learning 1996-06-10 Paper...10 bytes (18 words) - 13:54, 24 September 2023
- Publication Date of Publication Type Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and...10 bytes (16 words) - 00:04, 25 September 2023
- de/entity/Q4240812 1999-07-05 Paper Analytical mean squared error curves for temporal difference learning 1998-09-07 Paper Recognition in Hierarchical Models 1997-08-07...10 bytes (17 words) - 20:23, 12 December 2023
- 2015-08-10 Paper Evolving small-board Go players using coevolutionary temporal difference learning with archives 2014-03-26 Paper...10 bytes (16 words) - 03:08, 25 September 2023
- Prefrontal Cortex 2019-06-04 Paper Hyperbolically Discounted Temporal Difference Learning 2010-06-11 Paper How laminar frontal cortex and basal ganglia...10 bytes (18 words) - 22:17, 24 September 2023