TD-regularized actor-critic methods (Q2320580)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

scientific article; zbMATH DE number 7097478

Language	Label	Description	Also known as
default for all languages	No label defined
English	TD-regularized actor-critic methods	scientific article; zbMATH DE number 7097478

Statements

instance of

scholarly article

0 references

title

TD-regularized actor-critic methods (English)

0 references

0 references

0 references

0 references

Mohammad Emtiyaz Khan

0 references

published in

Machine Learning

0 references

publication date

23 August 2019

0 references

full work available at URL

https://arxiv.org/abs/1812.08288

0 references

zbMATH Keywords

reinforcement learning

0 references

actor-critic

0 references

temporal difference

0 references

describes a project that uses

0 references

0 references

0 references

0 references

MaRDI publication profile

0 references

cites work

Q4558153

0 references

Q4821526

0 references

Variance reduction techniques for gradient estimates in reinforcement learning

0 references

OnActor-Critic Algorithms

0 references

\(\text{Q}(\lambda)\) with off-policy corrections

0 references

Q5491447

0 references

Reinforcement learning. An introduction

0 references

Simple statistical gradient-following algorithms for connectionist reinforcement learning

0 references

Identifiers

zbMATH Open document ID

1493.68313

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

10.1007/S10994-019-05788-0

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2320580