Pages that link to "Item:Q1013655"
From MaRDI portal
The following pages link to Stochastic approximation. A dynamical systems viewpoint. (Q1013655):
Displaying 42 items.
- Continuous monitoring for changepoints in data streams using adaptive estimation (Q159921) (← links)
- Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation (Q367468) (← links)
- Local voting protocol for decentralized load balancing of network with switched topology and noise in measurements (Q460569) (← links)
- Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions (Q507334) (← links)
- Quasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimization (Q523576) (← links)
- Distributed resource allocation over random networks based on stochastic approximation (Q1643399) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- Stochastic heavy ball (Q1697485) (← links)
- Asymptotic bias of stochastic gradient search (Q1704136) (← links)
- Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling (Q2051259) (← links)
- Revisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximation (Q2070010) (← links)
- An ODE method to prove the geometric convergence of adaptive stochastic algorithms (Q2074991) (← links)
- Fundamental design principles for reinforcement learning algorithms (Q2094028) (← links)
- Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
- Stochastic approximation method using diagonal positive-definite matrices for convex optimization with fixed point constraints (Q2138441) (← links)
- Stochastic approximation with random step sizes and urn models with random replacement matrices having finite mean (Q2330454) (← links)
- Approximate consensus in the dynamic stochastic network with incomplete information and measurement delays (Q2393044) (← links)
- Multi-armed bandits based on a variant of simulated annealing (Q2520136) (← links)
- Event-driven stochastic approximation (Q2520142) (← links)
- Langevin type limiting processes for adaptive MCMC (Q2520143) (← links)
- Central limit theorems for stochastic approximation with controlled Markov chain dynamics (Q2786468) (← links)
- Simultaneous Perturbation Stochastic Approximation with Norm-Limited Update Vector (Q2813999) (← links)
- Approximate policy iteration: a survey and some new methods (Q2887629) (← links)
- Opportunistic Transmission over Randomly Varying Channels (Q3616977) (← links)
- Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation (Q5009779) (← links)
- A Stochastic Approximation Method for Simulation-Based Quantile Optimization (Q5060775) (← links)
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search (Q5102286) (← links)
- A Concentration Bound for Stochastic Approximation via Alekseev’s Formula (Q5113889) (← links)
- Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies (Q5139670) (← links)
- Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis (Q5162625) (← links)
- Improving control performance across AWGN channels using a relay node<sup>†</sup> (Q5168008) (← links)
- Asynchronous stochastic approximation with differential inclusions (Q5168859) (← links)
- (Q5168862) (← links)
- Robust scheduling for flexible processing networks (Q5233182) (← links)
- Actor-Critic Algorithms with Online Feature Adaptation (Q5270681) (← links)
- Smoothed Functional Algorithms for Stochastic Optimization Using <i>q</i> -Gaussian Distributions (Q5270716) (← links)
- Multi-agent consensus under a communication–broadcast mixed environment (Q5494522) (← links)
- Distributed Stochastic Optimization with Large Delays (Q5868949) (← links)
- Analyzing Approximate Value Iteration Algorithms (Q5868951) (← links)
- Sparse optimal control problems with intermediate constraints: Necessary conditions (Q6053697) (← links)
- Diffusion of binary opinions in a growing population with heterogeneous behaviour and external influence (Q6145315) (← links)
- Independent learning in stochastic games (Q6200215) (← links)