Smoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpoint (Q2242923): Difference between revisions

From MaRDI portal

Jump to:navigation, search

← Older edit Newer edit →

Revision as of 07:25, 5 March 2024

scientific article

Language	Label	Description	Also known as
English	Smoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpoint	scientific article

Statements

scholarly article

0 references

Smoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpoint (English)

0 references

10.1016/j.sysconle.2021.104988

0 references

0 references

L. A. Prashanth

0 references

Systems \& Control Letters

0 references

publication date

10 November 2021

0 references

full work available at URL

https://arxiv.org/abs/2101.02137

0 references

Mathematics Subject Classification ID

0 references

zbMATH DE Number

0 references

zbMATH Keywords

off-policy

0 references

reinforcement learning

0 references

smoothed functional

0 references

gradient estimation

0 references

MaRDI profile type

MaRDI publication profile

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2242923

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2242923&oldid=26815840"