Pages that link to "Item:Q921715"
From MaRDI portal
The following pages link to Nonconvergence to unstable points in urn models and stochastic approximations (Q921715):
Displaying 50 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation (Q367468) (← links)
- Nonconvergence to saddle boundary points under perturbed reinforcement learning (Q495761) (← links)
- Quasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimization (Q523576) (← links)
- Heterogeneous beliefs and local information in stochastic fictitious play (Q625038) (← links)
- On a notion of partially conditionally identically distributed sequences (Q681988) (← links)
- Stochastic approximation, cooperative dynamics and supermodular games (Q691120) (← links)
- A behavioral learning process in games (Q700080) (← links)
- Learning across games (Q765219) (← links)
- Learning in monotone Bayesian games (Q900836) (← links)
- A time-dependent version of Pólya's urn (Q920465) (← links)
- Learning to signal: Analysis of a micro-level reinforcement model (Q1004397) (← links)
- Natural actor-critic algorithms (Q1049136) (← links)
- Vertex-reinforced random walk (Q1184048) (← links)
- Vertex-reinforced random walks and a conjecture of Pemantle (Q1356346) (← links)
- Convergent multiple-timescales reinforcement learning algorithms in normal form games (Q1429103) (← links)
- Adaptive dynamics in games played by heterogeneous populations (Q1566893) (← links)
- On generalized Pólya urn models (Q1579854) (← links)
- Stochastic heavy ball (Q1697485) (← links)
- Stochastic learning in multi-agent optimization: communication and payoff-based approaches (Q1716626) (← links)
- Negatively reinforced balanced urn schemes (Q1738316) (← links)
- On the convergence of reinforcement learning (Q1779805) (← links)
- Mixed equilibria and dynamical systems arising from fictitious play in perturbed games (Q1818284) (← links)
- On (un)knots and dynamics in games (Q1867023) (← links)
- When can the two-armed bandit algorithm be trusted? (Q1879915) (← links)
- Generalized urn models of evolutionary processes. (Q1879916) (← links)
- Asymptotic pseudotrajectories and chain recurrent flows, with applications (Q1915998) (← links)
- An ODE method to prove the geometric convergence of adaptive stochastic algorithms (Q2074991) (← links)
- Vertex reinforced random walks with exponential interaction on complete graphs (Q2132542) (← links)
- Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance (Q2233558) (← links)
- Nonlinear randomized urn models: a stochastic approximation viewpoint (Q2274219) (← links)
- First-order methods almost always avoid strict saddle points (Q2425175) (← links)
- Attracting edge and strongly edge reinforced walks (Q2456028) (← links)
- Learning, information, and sorting in market entry games: theory and evidence (Q2486151) (← links)
- Avoidance of traps in stochastic approximation (Q2503522) (← links)
- Self-interacting diffusions. III: Symmetric interactions (Q2571693) (← links)
- Time to absorption in discounted reinforcement models. (Q2574614) (← links)
- Attainability of boundary points under reinforcement learning (Q2577444) (← links)
- Phase transitions in non-linear urns with interacting types (Q2676930) (← links)
- Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement (Q2692526) (← links)
- An Adjusted Payoff-Based Procedure for Normal Form Games (Q2833112) (← links)
- DRAWING MULTISETS OF BALLS FROM TENABLE BALANCED LINEAR URNS (Q2845126) (← links)
- A Herding Perspective on Global Games and Multiplicity (Q3394927) (← links)
- How Fast Is the Bandit? (Q3506304) (← links)
- Urn models and differential algebraic equations (Q4435682) (← links)
- (Q4558484) (← links)
- A Newton-Based Method for Nonconvex Optimization with Fast Evasion of Saddle Points (Q4620423) (← links)
- Gradient Descent Only Converges to Minimizers: Non-Isolated Critical Points and Invariant Regions (Q4638050) (← links)
- Mutation, Sexual Reproduction and Survival in Dynamic Environments (Q4638065) (← links)
- NEWTONIAN MECHANICS AND NASH PLAY (Q4661975) (← links)