Pages that link to "Item:Q1013655"

From MaRDI portal

← Stochastic approximation. A dynamical systems viewpoint. (Q1013655)

Jump to:navigation, search

The following pages link to Stochastic approximation. A dynamical systems viewpoint. (Q1013655):

Displaying 42 items.

Continuous monitoring for changepoints in data streams using adaptive estimation (Q159921) (← links)
Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation (Q367468) (← links)
Local voting protocol for decentralized load balancing of network with switched topology and noise in measurements (Q460569) (← links)
Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions (Q507334) (← links)
Quasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimization (Q523576) (← links)
Distributed resource allocation over random networks based on stochastic approximation (Q1643399) (← links)
Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
Stochastic heavy ball (Q1697485) (← links)
Asymptotic bias of stochastic gradient search (Q1704136) (← links)
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling (Q2051259) (← links)
Revisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximation (Q2070010) (← links)
An ODE method to prove the geometric convergence of adaptive stochastic algorithms (Q2074991) (← links)
Fundamental design principles for reinforcement learning algorithms (Q2094028) (← links)
Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
Stochastic approximation method using diagonal positive-definite matrices for convex optimization with fixed point constraints (Q2138441) (← links)
Stochastic approximation with random step sizes and urn models with random replacement matrices having finite mean (Q2330454) (← links)
Approximate consensus in the dynamic stochastic network with incomplete information and measurement delays (Q2393044) (← links)
Multi-armed bandits based on a variant of simulated annealing (Q2520136) (← links)
Event-driven stochastic approximation (Q2520142) (← links)
Langevin type limiting processes for adaptive MCMC (Q2520143) (← links)
Central limit theorems for stochastic approximation with controlled Markov chain dynamics (Q2786468) (← links)
Simultaneous Perturbation Stochastic Approximation with Norm-Limited Update Vector (Q2813999) (← links)
Approximate policy iteration: a survey and some new methods (Q2887629) (← links)
Opportunistic Transmission over Randomly Varying Channels (Q3616977) (← links)
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation (Q5009779) (← links)
A Stochastic Approximation Method for Simulation-Based Quantile Optimization (Q5060775) (← links)
Risk-Sensitive Reinforcement Learning via Policy Gradient Search (Q5102286) (← links)
A Concentration Bound for Stochastic Approximation via Alekseev’s Formula (Q5113889) (← links)
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies (Q5139670) (← links)
Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis (Q5162625) (← links)
Improving control performance across AWGN channels using a relay node<sup>†</sup> (Q5168008) (← links)
Asynchronous stochastic approximation with differential inclusions (Q5168859) (← links)
(Q5168862) (← links)
Robust scheduling for flexible processing networks (Q5233182) (← links)
Actor-Critic Algorithms with Online Feature Adaptation (Q5270681) (← links)
Smoothed Functional Algorithms for Stochastic Optimization Using <i>q</i> -Gaussian Distributions (Q5270716) (← links)
Multi-agent consensus under a communication–broadcast mixed environment (Q5494522) (← links)
Distributed Stochastic Optimization with Large Delays (Q5868949) (← links)
Analyzing Approximate Value Iteration Algorithms (Q5868951) (← links)
Sparse optimal control problems with intermediate constraints: Necessary conditions (Q6053697) (← links)
Diffusion of binary opinions in a growing population with heterogeneous behaviour and external influence (Q6145315) (← links)
Independent learning in stochastic games (Q6200215) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q1013655"