Pages that link to "Item:Q1812928"
From MaRDI portal
The following pages link to Simple statistical gradient-following algorithms for connectionist reinforcement learning (Q1812928):
Displaying 8 items.
- Policy search for motor primitives in robotics (Q413874) (← links)
- Analysis and improvement of policy gradient estimation (Q448295) (← links)
- Set-to-Sequence Methods in Machine Learning: A Review (Q5154747) (← links)
- Dynamic Neural Turing Machine with Continuous and Discrete Addressing Schemes (Q5157151) (← links)
- A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker (Q5380302) (← links)
- An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403) (← links)
- STDP-Compatible Approximation of Backpropagation in an Energy-Based Model (Q5380662) (← links)
- Nonconvex Policy Search Using Variational Inequalities (Q5380851) (← links)