A Small Gain Analysis of Single Timescale Actor Critic

From MaRDI portal
Publication:6042800

DOI10.1137/22M1483335arXiv2203.02591OpenAlexW4367311942MaRDI QIDQ6042800FDOQ6042800


Authors: Alex Olshevsky, Bahman Gharesifard Edit this on Wikidata


Publication date: 4 May 2023

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Abstract: We consider a version of actor-critic which uses proportional step-sizes and only one critic update with a single sample from the stationary distribution per actor step. We provide an analysis of this method using the small-gain theorem. Specifically, we prove that this method can be used to find a stationary point, and that the resulting sample complexity improves the state of the art for actor-critic methods to Oleft(mu2epsilon2ight) to find an epsilon-approximate stationary point where mu is the condition number associated with the critic.


Full work available at URL: https://arxiv.org/abs/2203.02591




Recommendations




Cites Work


Cited In (1)





This page was built for publication: A Small Gain Analysis of Single Timescale Actor Critic

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6042800)