Asymptotic bias of stochastic gradient search (Q1704136)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Asymptotic bias of stochastic gradient search

scientific article

Language	Label	Description	Also known as
default for all languages	No label defined
English	Asymptotic bias of stochastic gradient search	scientific article

Statements

scholarly article

0 references

Asymptotic bias of stochastic gradient search (English)

0 references

Vladislav B. Tadić

0 references

0 references

The Annals of Applied Probability

0 references

publication date

8 March 2018

0 references

full work available at URL

https://arxiv.org/abs/1709.00291

0 references

https://projecteuclid.org/euclid.aoap/1513328701

0 references

There is an investigation on the asymptotic behavior of biased stochastic gradient search. The following algorithm is analyzed: \({\theta}_{n+1}\) = \({\theta}_{n}\)- \({\alpha}_{n}({\nabla}f({\theta}_{n})+{\xi}_{n})\) , \(n{\geq}0\). Under a set of assumptions regarding the step-size sequence, the noise and the objective function f , the convergence of the algorithm iterates to a neighborhood of the set of minima, is proved. Upper bounds on the radius of the vicinity are obtained. The results are local, they hold only in case the stated algorithm is stable. The proofs are relaying on the chain-recurrence, Yomdin theorem and Lojasiewicz inequalities. Further , the obtained results are applied to stochastic gradient algorithms with Markovian dynamics and to the asymptotic analysis of a policy-gradient search algorithm for average-cost Markov decision problems. Global versions of the results are presented in the Appendix A and Appendix B of the paper. There is stipulated that an extended version of this article is available at \url{arXiv:1709.00291}.

0 references

Claudia Simionescu-Badea

0 references

zbMATH Keywords

stochastic gradient search

0 references

biased gradient estimation

0 references

chain-recurrence

0 references

Yomdin theorem

0 references

Lojasiewicz inequalities

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

0 references

A Dynamical System Approach to Stochastic Approximations

0 references

0 references

Stochastic Approximations and Differential Inclusions

0 references

Perturbations of set-valued dynamical systems, with applications to game theory

0 references

0 references

0 references

Gradient Convergence in Gradient methods with Errors

0 references

Semianalytic and subanalytic sets

0 references

Stochastic approximation. A dynamical systems viewpoint.

0 references

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

0 references

Inference in hidden Markov models.

0 references

Stochastic approximation and its applications

0 references

Robustness analysis for stochastic approximation algorithms

0 references

Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds

0 references

Nonlinear time series. Theory, methods and applications with R examples

0 references

Chain recurrence, semiflows, and gradients

0 references

Nonlinear systems.

0 references

OnActor-Critic Algorithms

0 references

On gradients of functions definable in o-minimal structures

0 references

0 references

Sur le problème de la division

0 references

On semi- and subanalytic geometry

0 references

Applications of a Kushner and Clark lemma to general classes of stochastic algorithms

0 references

Markov Chains and Stochastic Stability

0 references

0 references

Approximate Dynamic Programming

0 references

Particle approximations of the score and observed information matrix in state space models with application to parameter estimation

0 references

Introduction to Stochastic Search and Optimization

0 references

Analyticity, Convergence, and Convergence Rate of Recursive Maximum-Likelihood Estimation in Hidden Markov Models

0 references

The geometry of critical and near-critical values of differentiable mappings

0 references

Recommended article

Similarity Score

0.91987956

Recommender Run

Recommender Run 3

0 references

Nonasymptotic Bounds for Stochastic Optimization With Biased Noisy Gradient Oracles

Similarity Score

0.89981604

Recommender Run

Recommender Run 3

0 references

Exploration of the (non-)asymptotic bias and variance of stochastic gradient Langevin dynamics

Similarity Score

0.8966127

Recommender Run

Recommender Run 3

0 references

Analysis of biased stochastic gradient descent using sequential semidefinite programs

Similarity Score

0.8960116

Recommender Run

Recommender Run 3

0 references

Asymptotic and finite-sample properties of estimators based on stochastic gradients

Similarity Score

0.8768261

Recommender Run

Recommender Run 3

0 references

Non-asymptotic guarantees for sampling by stochastic gradient descent

Similarity Score

0.8705262

Recommender Run

Recommender Run 3

0 references

Similarity Score

0.86985576

Recommender Run

Recommender Run 3

0 references

Similarity Score

0.86695766

Recommender Run

Recommender Run 3

0 references

On the steplength selection in stochastic gradient methods

Similarity Score

0.8646956

Recommender Run

Recommender Run 3

0 references

Asymptotic optimality in stochastic optimization

Similarity Score

0.86458766

Recommender Run

Recommender Run 3

0 references

Identifiers

zbMATH Open document ID

0 references

10.1214/16-AAP1272

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1704136

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1704136&oldid=41166154"