Expected policy gradients for reinforcement learning (Q4969098)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

Expected policy gradients for reinforcement learning

scientific article; zbMATH DE number 7255083

Language	Label	Description	Also known as
default for all languages	No label defined
English	Expected policy gradients for reinforcement learning	scientific article; zbMATH DE number 7255083

Statements

0 references

0 references

0 references

5 October 2020

0 references

full work available at URL

https://arxiv.org/abs/1801.03326

0 references

https://jmlr.csail.mit.edu/papers/v21/18-012.html

0 references

zbMATH Keywords

policy gradients

0 references

exploration

0 references

bounded actions

0 references

reinforcement learning

0 references

Markov decision process (MDP)

0 references

describes a project that uses

0 references

0 references

0 references

0 references

0 references

MaRDI publication profile

0 references

cites work

Q4558153

0 references

Natural actor-critic algorithms

0 references

Q4188569

0 references

Optimal Estimation of Dynamic Systems

0 references

Approximate Newton methods for policy search in Markov decision processes

0 references

Bayesian policy gradient and actor-critic algorithms

0 references

10.1162/1532443041827907

0 references

Multi-objective reinforcement learning through continuous Pareto manifold approximation

0 references

Policy gradient in Lipschitz Markov decision processes

0 references

Q4315289

0 references

Some Relations Between Extended and Unscented Kalman Filters

0 references

Reinforcement learning. An introduction

0 references

Identifiers

zbMATH Open document ID

1498.68229

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:4969098