Asymptotic bias of stochastic gradient search (Q1704136): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(4 intermediate revisions by 4 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1709.00291 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3324260 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4533362 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Dynamical System Approach to Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4938927 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbations of set-valued dynamical systems, with applications to game theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Gradient Convergence in Gradient methods with Errors / rank
 
Normal rank
Property / cites work
 
Property / cites work: Semianalytic and subanalytic sets / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation. A dynamical systems viewpoint. / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Inference in hidden Markov models. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation and its applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Robustness analysis for stochastic approximation algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2871232 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Chain recurrence, semiflows, and gradients / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2771497 / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: On gradients of functions definable in o-minimal structures / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4421713 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sur le problème de la division / rank
 
Normal rank
Property / cites work
 
Property / cites work: On semi- and subanalytic geometry / rank
 
Normal rank
Property / cites work
 
Property / cites work: Applications of a Kushner and Clark lemma to general classes of stochastic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Chains and Stochastic Stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4335417 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Particle approximations of the score and observed information matrix in state space models with application to parameter estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to Stochastic Search and Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analyticity, Convergence, and Convergence Rate of Recursive Maximum-Likelihood Estimation in Hidden Markov Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: The geometry of critical and near-critical values of differentiable mappings / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2591423585 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 09:07, 30 July 2024

scientific article
Language Label Description Also known as
English
Asymptotic bias of stochastic gradient search
scientific article

    Statements

    Asymptotic bias of stochastic gradient search (English)
    0 references
    0 references
    0 references
    8 March 2018
    0 references
    There is an investigation on the asymptotic behavior of biased stochastic gradient search. The following algorithm is analyzed: \({\theta}_{n+1}\) = \({\theta}_{n}\)- \({\alpha}_{n}({\nabla}f({\theta}_{n})+{\xi}_{n})\) , \(n{\geq}0\). Under a set of assumptions regarding the step-size sequence, the noise and the objective function f , the convergence of the algorithm iterates to a neighborhood of the set of minima, is proved. Upper bounds on the radius of the vicinity are obtained. The results are local, they hold only in case the stated algorithm is stable. The proofs are relaying on the chain-recurrence, Yomdin theorem and Lojasiewicz inequalities. Further , the obtained results are applied to stochastic gradient algorithms with Markovian dynamics and to the asymptotic analysis of a policy-gradient search algorithm for average-cost Markov decision problems. Global versions of the results are presented in the Appendix A and Appendix B of the paper. There is stipulated that an extended version of this article is available at \url{arXiv:1709.00291}.
    0 references
    stochastic gradient search
    0 references
    biased gradient estimation
    0 references
    chain-recurrence
    0 references
    Yomdin theorem
    0 references
    Lojasiewicz inequalities
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references