Short-term plasticity as cause-effect hypothesis testing in distal reward learning

DOI10.1007/S00422-014-0628-0MaRDI QIDQ309633zbMATH OpenOpenAlexWikidataFDO

Authors Andrea Soltoggio

Publication date 7 September 2016

Published in Biological Cybernetics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1402.0710

zbMATH Keywords

distal reward memory consolidation operant learning plasticity versus stability short-term plasticity transient weights

Mathematics Subject Classification ID

Neural biology (92C20)

Abstract: Asynchrony, overlaps and delays in sensory-motor signals introduce ambiguity as to which stimuli, actions, and rewards are causally related. Only the repetition of reward episodes helps distinguish true cause-effect relationships from coincidental occurrences. In the model proposed here, a novel plasticity rule employs short and long-term changes to evaluate hypotheses on cause-effect relationships. Transient weights represent hypotheses that are consolidated in long-term memory only when they consistently predict or cause future rewards. The main objective of the model is to preserve existing network topologies when learning with ambiguous information flows. Learning is also improved by biasing the exploration of the stimulus-response space towards actions that in the past occurred before rewards. The model indicates under which conditions beliefs can be consolidated in long-term memory, it suggests a solution to the plasticity-stability dilemma, and proposes an interpretation of the role of short-term plasticity.

Recommendations

Cites work

Cited in

(1)

Corticostriatal synaptic weight evolution in a two-alternative forced choice task: a computational study

This page was built for publication: Short-term plasticity as cause-effect hypothesis testing in distal reward learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q309633)