scientific article; zbMATH DE number 7370638

Publication date: 9 July 2021

Full work available at URL: https://arxiv.org/abs/2006.11674

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

weak convergence logistic regression variance reduction Langevin dynamics stochastic sampling stochastic gradient algorithm inverse reinforcement learning constrained Markov decision process passive learning Bernstein von-Mises theorem inverse Bayesian learning Markov chain hyper-parameter martingale averaging theory

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Continuous-time Markov chains and applications. A two-time-scale approach
Accelerating diffusions
How does a stochastic optimization/approximation algorithm adapt to a randomly evolving optimum/root with jump Markov sample paths
The strong ergodic theorem for densities: Generalized Shannon-McMillan- Breiman theorem
Accelerating Gaussian diffusions
Langevin-type models. I: Diffusions with given stationary distributions and their discretizations
Inference in hidden Markov models.
Passive stochastic approximation
Partially Observed Markov Decision Processes
Asynchronous Stochastic Approximation Algorithms for Networked Systems: Regime-Switching Topologies and Multiscale Structure
Online Markov Decision Processes With Kullback–Leibler Control Cost
Nonparametric sequential estimation of zeros and extrema of regression functions
Recursive Stochastic Algorithms for Global Optimization in $\mathbb{R}^d $
Analysis of recursive stochastic algorithms
How to apply the method of stochastic approximation in the non-parametric estimation of a regression function¹
Introduction to Stochastic Search and Optimization
${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control
Monotonicity of Constrained Optimal Transmission Policies in Correlated Fading Channels With ARQ
Regime Switching Stochastic Approximation Algorithms with Application to Adaptive Discrete Stochastic Optimization
Passive stochastic approximation with constant step size and window width
Distributed Subgradient Methods for Multi-Agent Optimization
Stochastic Processes and Applications
The Construction of Utility Functions from Expenditure Data