Long-Term Reward Prediction in TD Models of the Dopamine System

From MaRDI portal

Revision as of 03:14, 7 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4409377

Jump to:navigation, search

DOI10.1162/089976602760407973zbMath1021.92005WikidataQ40621633 ScholiaQ40621633MaRDI QIDQ4409377

Nathaniel D. Daw, David S. Touretzky

Publication date: 22 October 2003

Published in: Neural Computation (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1162/089976602760407973

Mathematics Subject Classification ID

92C20: Neural biology

Related Items

Representation and Timing in Theories of the Dopamine System, Computational algorithms and neuronal network models underlying decision processes, Neural systems implicated in delayed and probabilistic reinforcement, Reinforcement learning in the brain, Multiple model-based reinforcement learning explains dopamine neuronal activity, Internal-Time Temporal Difference Model for Neural Value-Based Decision Making, A Neurocomputational Model for Cocaine Addiction, The Actor-Critic Learning Is Behind the Matching Law: Matching Versus Optimal Behaviors, Hyperbolically Discounted Temporal Difference Learning

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4409377&oldid=18435761"