Approximate Newton Policy Gradient Algorithms (Q6074547)

From MaRDI portal

Jump to:navigation, search

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

Approximate Newton Policy Gradient Algorithms

scientific article; zbMATH DE number 7749379

Language	Label	Description	Also known as
default for all languages	No label defined
English	Approximate Newton Policy Gradient Algorithms	scientific article; zbMATH DE number 7749379

Statements

scholarly article

0 references

Approximate Newton Policy Gradient Algorithms (English)

0 references

0 references

0 references

0 references

0 references

Inderjit S. Dhillon

0 references

SIAM Journal on Scientific Computing

0 references

publication date

12 October 2023

0 references

full work available at URL

https://arxiv.org/abs/2110.02398

0 references

zbMATH Keywords

policy gradient algorithm

0 references

approximate Newton method

0 references

quadratic convergence

0 references

Markov decision process

0 references

entropy regularization

0 references

reinforcement learning

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

0 references

0 references

Fast global convergence of natural policy gradient methods with entropy regularization

0 references

A comparison of iterative methods for solving nonsymmetric linear systems

0 references

A Characterization of Superlinear Convergence and Its Application to Quasi-Newton Methods

0 references

0 references

OnActor-Critic Algorithms

0 references

Softmax policy gradient methods can take exponential time to converge

0 references

0 references

0 references

Primal-dual subgradient methods for convex problems

0 references

The Information Geometry of Mirror Descent

0 references

0 references

New results on superlinear convergence of classical quasi-Newton methods

0 references

Rates of superlinear convergence for classical quasi-Newton methods

0 references

0 references

Reinforcement learning. An introduction

0 references

Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems

0 references

Hessian informed mirror descent

0 references

Simple statistical gradient-following algorithms for connectionist reinforcement learning

0 references

Mirror descent algorithms for minimizing interacting free energy

0 references

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

0 references

Recommended article

Approximate Newton methods for policy search in Markov decision processes

Similarity Score

0.835472047328949

Recommender Run

Recommender Run 4

0 references

Fast global convergence of natural policy gradient methods with entropy regularization

Similarity Score

0.830440104007721

Recommender Run

Recommender Run 4

0 references

On linear and super-linear convergence of natural policy gradient algorithm

Similarity Score

0.774370551109314

Recommender Run

Recommender Run 4

0 references

Entropy Regularization for Mean Field Games with Learning

Similarity Score

0.74680495262146

Recommender Run

Recommender Run 4

0 references

Natural actor-critic algorithms

Similarity Score

0.7461656332015991

Recommender Run

Recommender Run 4

0 references

Identifiers

zbMATH Open document ID

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

10.1137/22M1492088

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6074547

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q6074547&oldid=58665717"