Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

From MaRDI portal
Publication:3593959


DOI10.1162/neco.2007.19.6.1468zbMath1115.68473WikidataQ47841509 ScholiaQ47841509MaRDI QIDQ3593959

Răzvan V. Florian

Publication date: 6 August 2007

Published in: Neural Computation (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1162/neco.2007.19.6.1468


68W05: Nonnumerical algorithms

68T05: Learning and adaptive systems in artificial intelligence


Related Items

Supervised Learning in Spiking Neural Networks with ReSuMe: Sequence Learning, Classification, and Spike Shifting, Learning with Precise Spike Times: A New Decoding Algorithm for Liquid State Machines, Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses, A Spiking Neural Model for Stable Reinforcement of Synapses Based on Multiple Distal Rewards, A New Supervised Learning Algorithm for Spiking Neurons, Spike-Timing-Dependent Construction, A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker, Learning Spatiotemporally Encoded Pattern Transformations in Structured Spiking Neural Networks, Sailboat navigation control system based on spiking neural networks, Second-order information bottleneck based spiking neural networks for sEMG recognition, Short-term plasticity as cause-effect hypothesis testing in distal reward learning, Adaptive learning rate of SpikeProp based on weight convergence analysis, Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison, Phenomenological models of synaptic plasticity based on spike timing, Supervised learning with decision margins in pools of spiking neurons, Robust spike-train learning in spike-event based weight update, Robust learning in SpikeProp, A supervised multi-spike learning algorithm based on gradient descent for spiking neural networks, Statistical Mechanics of Reward-Modulated Learning in Decision-Making Networks, Learning Spike-Based Population Codes by Reward and Population Feedback, A Spiking Neural Network Model of an Actor-Critic Learning Agent, A Gradient Learning Rule for the Tempotron, On the Asymptotic Equivalence Between Differential Hebbian and Temporal Difference Learning



Cites Work