Analysis and improvement of policy gradient estimation (Q448295)

scientific article; zbMATH DE number 6074433

Language	Label	Description	Also known as
default for all languages	No label defined
English	Analysis and improvement of policy gradient estimation	scientific article; zbMATH DE number 6074433

Statements

instance of

scholarly article

0 references

title

Analysis and improvement of policy gradient estimation (English)

0 references

0 references

0 references

0 references

0 references

0 references

30 August 2012

0 references

zbMATH Keywords

reinforcement learning

0 references

policy gradients

0 references

policy gradients with parameter-based exploration

0 references

variance reduction

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.neunet.2011.09.005

0 references

cites work

Q4533363

0 references

Using Expectation-Maximization for Reinforcement Learning

0 references

Q4692508

0 references

Variance reduction techniques for gradient estimates in reinforcement learning

0 references

Q4427427

0 references

10.1162/1532443041827907

0 references

Q2769922

0 references

Approximate gradient methods in policy-space optimization of Markov reward processes

0 references

Simple statistical gradient-following algorithms for connectionist reinforcement learning

0 references

Identifiers

zbMATH Open document ID

1245.68165

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

journals/nn/ZhaoHNS12

0 references

DOI

10.1016/J.NEUNET.2011.09.005

0 references

Sitelinks

Mathematics(1 entry)

mardi Analysis and improvement of policy gradient estimation