Short-term plasticity as cause-effect hypothesis testing in distal reward learning
From MaRDI portal
Publication:309633
DOI10.1007/S00422-014-0628-0zbMATH Open1344.92041arXiv1402.0710OpenAlexW3098750840WikidataQ39134778 ScholiaQ39134778MaRDI QIDQ309633FDOQ309633
Authors: Andrea Soltoggio
Publication date: 7 September 2016
Published in: Biological Cybernetics (Search for Journal in Brave)
Abstract: Asynchrony, overlaps and delays in sensory-motor signals introduce ambiguity as to which stimuli, actions, and rewards are causally related. Only the repetition of reward episodes helps distinguish true cause-effect relationships from coincidental occurrences. In the model proposed here, a novel plasticity rule employs short and long-term changes to evaluate hypotheses on cause-effect relationships. Transient weights represent hypotheses that are consolidated in long-term memory only when they consistently predict or cause future rewards. The main objective of the model is to preserve existing network topologies when learning with ambiguous information flows. Learning is also improved by biasing the exploration of the stimulus-response space towards actions that in the past occurred before rewards. The model indicates under which conditions beliefs can be consolidated in long-term memory, it suggests a solution to the plasticity-stability dilemma, and proposes an interpretation of the role of short-term plasticity.
Full work available at URL: https://arxiv.org/abs/1402.0710
Recommendations
- Cerebral mechanism for reward-mediated learning: A mathematical model of neuropopulational network plasticity
- Phenomenological models of synaptic plasticity based on spike timing
- Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity
- A spiking neural model for stable reinforcement of synapses based on multiple distal rewards
- Learning to discriminate through long-term changes of dynamical synaptic transmission
distal rewardmemory consolidationoperant learningplasticity versus stabilityshort-term plasticitytransient weights
Cites Work
- Learning Bayesian networks: The combination of knowledge and statistical data
- Combining Hebbian and reinforcement learning in a minibrain model
- Eluding oblivion with smart stochastic selection of synaptic updates
- Learning Spike-Based Population Codes by Reward and Population Feedback
- Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity
- Hebbian learning, its correlation catastrophe, and unlearning
- A spiking neural model for stable reinforcement of synapses based on multiple distal rewards
- Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
- Learning Only When Necessary: Better Memories of Correlated Patterns in Networks with Bounded Synapses
Cited In (1)
This page was built for publication: Short-term plasticity as cause-effect hypothesis testing in distal reward learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q309633)