Nonconvergence to unstable points in urn models and stochastic approximations

From MaRDI portal
Publication:921715

DOI10.1214/aop/1176990853zbMath0709.60054OpenAlexW2015836240MaRDI QIDQ921715

Robin Pemantle

Publication date: 1990

Published in: The Annals of Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aop/1176990853




Related Items (68)

When can the two-armed bandit algorithm be trusted?Generalized urn models of evolutionary processes.Vertex reinforced random walks with exponential interaction on complete graphsMultiscale Q-learning with linear function approximationVertex-reinforced random walks and a conjecture of PemantleAsymptotic pseudotrajectories and chain recurrent flows, with applicationsDistributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formationEscaping Strict Saddle Points of the Moreau Envelope in Nonsmooth OptimizationTwo repelling random walks on \(\mathbb{Z}\)Phase transitions in non-linear urns with interacting typesGlobal convergence of the gradient method for functions definable in o-minimal structuresRisk-Sensitive Reinforcement Learning via Policy Gradient SearchConvergence of the Momentum Method for Semialgebraic Functions with Locally Lipschitz GradientsHow Fast Is the Bandit?Heterogeneous beliefs and local information in stochastic fictitious playFirst-order methods almost always avoid strict saddle pointsSufficient Conditions for Instability of the Subgradient Method with Constant Step SizeLearning in monotone Bayesian gamesInteracting nonlinear reinforced stochastic processes: Synchronization or non-synchronizationStochastic heavy ballDynamics of Morse-Smale urn processesTwo-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placementCoexistence in Preferential Attachment NetworksA time-dependent version of Pólya's urnStochastic learning in multi-agent optimization: communication and payoff-based approachesA Newton-Based Method for Nonconvex Optimization with Fast Evasion of Saddle PointsSecond-Order Guarantees of Distributed Gradient AlgorithmsConvergent multiple-timescales reinforcement learning algorithms in normal form gamesGlobal Convergence of Policy Gradient Methods to (Almost) Locally Optimal PoliciesAttracting edge and strongly edge reinforced walksVertex-reinforced random walkStochastic optimization with momentum: convergence, fluctuations, and traps avoidanceNegatively reinforced balanced urn schemesNonconvergence to saddle boundary points under perturbed reinforcement learningOn a notion of partially conditionally identically distributed sequencesQuasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimizationStochastic approximation, cooperative dynamics and supermodular gamesNEWTONIAN MECHANICS AND NASH PLAYLearning, information, and sorting in market entry games: theory and evidenceA behavioral learning process in gamesUnnamed ItemNonlinear randomized urn models: a stochastic approximation viewpointOn the convergence of reinforcement learningAvoidance of traps in stochastic approximationThe actor-critic algorithm as multi-time-scale stochastic approximation.Stochastic approximation algorithms: overview and recent trends.Urn models and differential algebraic equationsLearning to signal: Analysis of a micro-level reinforcement modelMixed equilibria and dynamical systems arising from fictitious play in perturbed gamesA Herding Perspective on Global Games and MultiplicityLearning across gamesAdaptive dynamics in games played by heterogeneous populationsAn Adjusted Payoff-Based Procedure for Normal Form GamesMutation, Sexual Reproduction and Survival in Dynamic EnvironmentsGradient Descent Only Converges to Minimizers: Non-Isolated Critical Points and Invariant RegionsAn ODE method to prove the geometric convergence of adaptive stochastic algorithmsDRAWING MULTISETS OF BALLS FROM TENABLE BALANCED LINEAR URNSOn generalized Pólya urn modelsOn the fast convergence of random perturbations of the gradient flowNatural actor-critic algorithmsSelf-interacting diffusions. III: Symmetric interactionsTime to absorption in discounted reinforcement models.Attainability of boundary points under reinforcement learningOne-dimensional system arising in stochastic gradient descentForbidden Transactions and Black MarketsOn (un)knots and dynamics in gamesExtending the Step-Size Restriction for Gradient Descent to Avoid Strict Saddle PointsOn Gradient-Based Learning in Continuous Games




This page was built for publication: Nonconvergence to unstable points in urn models and stochastic approximations