Do stochastic algorithms avoid traps?

From MaRDI portal
Revision as of 14:39, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1917693

zbMath0849.62043MaRDI QIDQ1917693

Marie Duflo, Odile Brandière

Publication date: 27 October 1996

Published in: Annales de l'Institut Henri Poincaré. Probabilités et Statistiques (Search for Journal in Brave)

Full work available at URL: http://www.numdam.org/item?id=AIHPB_1996__32_3_395_0




Related Items (29)

When can the two-armed bandit algorithm be trusted?Sur quelques algorithmes récursifs pour les probabilités numériquesVertex-reinforced random walks and a conjecture of PemantleConvergence rates for annealing diffusion processesA space quantization method for numerical integrationRisk-Sensitive Reinforcement Learning via Policy Gradient SearchHeterogeneous beliefs and local information in stochastic fictitious playOn maximum a posteriori estimation with Plug \& Play priors and stochastic gradient descentStochastic heavy ballWidening the scope of an eigenvector stochastic approximation process and application to streaming PCA and related methodsStochastic optimization with momentum: convergence, fluctuations, and traps avoidanceStochastic approximation, cooperative dynamics and supermodular gamesRecursive estimation for ordered eigenvectors of symmetric matrix with observation noiseUnnamed ItemNonlinear randomized urn models: a stochastic approximation viewpointOn the convergence of reinforcement learningAvoidance of traps in stochastic approximationThe actor-critic algorithm as multi-time-scale stochastic approximation.Stochastic approximation algorithms: overview and recent trends.Convergence of a stochastic approximation version of the EM algorithmSharp convergence rates of stochastic approximation for degenerate rootsMixed equilibria and dynamical systems arising from fictitious play in perturbed gamesAn Adjusted Payoff-Based Procedure for Normal Form GamesStrongly vertex-reinforced jump process on a complete graphImportance sampling for families of distributionsStochastic approximation algorithms with constant step size whose average is cooperativeWeak convergence rates for stochastic approximation with application to multiple targets and simulated annealingPiecewise constant triangular cooling schedules for generalized simulated annealing algorithmsWeak dependence beyond mixing and asymptotics for nonparametric regression




This page was built for publication: Do stochastic algorithms avoid traps?