The following pages link to (Q4346705):
Displayed 50 items.
- Accelerated randomized stochastic optimization. (Q1434014) (← links)
- Vertex-reinforced random walk on \(\mathbb Z\) has finite range (Q1568291) (← links)
- A class of learning/estimation algorithms using nominal values: Asymptotic analysis and applications (Q1579641) (← links)
- Deterministic convergence of an online gradient method for neural networks (Q1612376) (← links)
- Privacy preserving distributed optimization using homomorphic encryption (Q1716476) (← links)
- Synchronization and functional central limit theorems for interacting reinforced random walks (Q1756956) (← links)
- Estimation of an optimal solution of a LP problem with unknown objective function (Q1764240) (← links)
- Adaptive designs and Robbins-Monro algorithm (Q1776861) (← links)
- On the convergence of reinforcement learning (Q1779805) (← links)
- Real-time reinforcement learning by sequential actor-critics and experience replay (Q1784532) (← links)
- Convergence of a stochastic approximation version of the EM algorithm (Q1807177) (← links)
- Recursive estimation of a drifted autoregressive parameter. (Q1848802) (← links)
- Workshop on statistical approaches for the evaluation of complex computer models (Q1872604) (← links)
- Generalized neural networks for spectral analysis: dynamics and Liapunov functions (Q1877570) (← links)
- Convergence rate of linear two-time-scale stochastic approximation. (Q1879892) (← links)
- Analysis of a high-resolution optical wave-front control system (Q1883119) (← links)
- Perturbation analysis for production control and optimization of manufacturing systems (Q1888414) (← links)
- New stochastic approximation algorithms with adaptive step sizes (Q1926628) (← links)
- Stochastic Nelder-Mead simplex method -- a new globally convergent direct search method for simulation optimization (Q1926785) (← links)
- Error bounds for constant step-size \(Q\)-learning (Q1932736) (← links)
- Approximate stochastic annealing for online control of infinite horizon Markov decision processes (Q1937498) (← links)
- Long range search for maximum likelihood in exponential families (Q1950807) (← links)
- Online expectation maximization based algorithms for inference in hidden Markov models (Q1951134) (← links)
- Recursive estimation procedures for one-dimensional parameter of statistical models associated with semimartingales (Q2010658) (← links)
- Convergence of stochastic proximal gradient algorithm (Q2019902) (← links)
- Structural stability threshold for the condition of robust no deterministic sure arbitrage with unbounded profit (Q2038508) (← links)
- Fundamental design principles for reinforcement learning algorithms (Q2094028) (← links)
- Large deviations and stochastic stability in population games (Q2106071) (← links)
- Reference points and learning (Q2138367) (← links)
- ASD+M: automatic parameter tuning in stochastic optimization and on-line learning (Q2179079) (← links)
- \(\alpha\)-variational inference with statistical guarantees (Q2196198) (← links)
- Semimartingale stochastic approximation procedure and recursive estimation (Q2255959) (← links)
- Simultaneous perturbation Newton algorithms for simulation optimization (Q2260692) (← links)
- A latent discrete Markov random field approach to identifying and classifying historical forest communities based on spatial multivariate tree species counts (Q2291519) (← links)
- On the stability of an adaptive learning dynamics in traffic games (Q2319665) (← links)
- Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents (Q2334631) (← links)
- Convergence of the Robbins-Monro process with small steps from below (Q2345959) (← links)
- Equilibrium routing under uncertainty (Q2349119) (← links)
- A survey of randomized algorithms for control synthesis and performance verification (Q2371305) (← links)
- An information-theoretic analysis of return maximization in reinforcement learning (Q2375396) (← links)
- Quantile estimation with adaptive importance sampling (Q2380103) (← links)
- Online calibrated forecasts: memory efficiency versus universality for learning in games (Q2384142) (← links)
- Stochastic approximation with series of delayed observations (Q2402730) (← links)
- Scaling up Bayesian variational inference using distributed computing clusters (Q2411280) (← links)
- A sequential design for a clinical trial with a linear prognostic factor (Q2455418) (← links)
- Transient and asymptotic dynamics of reinforcement learning in games (Q2462286) (← links)
- Dynamic modeling and control of supply chain systems: A review (Q2483501) (← links)
- Equilibrium selection in games: the mollifier method (Q2486410) (← links)
- Bayesian and non-Bayesian analysis of gamma stochastic frontier models by Markov chain Monte Carlo methods (Q2488426) (← links)
- The asymptotic equipartition property in reinforcement learning and its relation to return maximization (Q2488678) (← links)