Long-Term Reward Prediction in TD Models of the Dopamine System
From MaRDI portal
Publication:4409377
DOI10.1162/089976602760407973zbMath1021.92005WikidataQ40621633 ScholiaQ40621633MaRDI QIDQ4409377
Nathaniel D. Daw, David S. Touretzky
Publication date: 22 October 2003
Published in: Neural Computation (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1162/089976602760407973
92C20: Neural biology
Related Items
Representation and Timing in Theories of the Dopamine System, Computational algorithms and neuronal network models underlying decision processes, Neural systems implicated in delayed and probabilistic reinforcement, Reinforcement learning in the brain, Multiple model-based reinforcement learning explains dopamine neuronal activity, Internal-Time Temporal Difference Model for Neural Value-Based Decision Making, A Neurocomputational Model for Cocaine Addiction, The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors, Hyperbolically Discounted Temporal Difference Learning
Cites Work