scientific article; zbMATH DE number 7370638
From MaRDI portal
Publication:4999096
George Yin, Vikram Krishnamurthy
Publication date: 9 July 2021
Full work available at URL: https://arxiv.org/abs/2006.11674
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
weak convergencelogistic regressionvariance reductionLangevin dynamicsstochastic samplingstochastic gradient algorithminverse reinforcement learningconstrained Markov decision processpassive learningBernstein von-Mises theoreminverse Bayesian learningMarkov chain hyper-parametermartingale averaging theory
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Continuous-time Markov chains and applications. A two-time-scale approach
- Accelerating diffusions
- How does a stochastic optimization/approximation algorithm adapt to a randomly evolving optimum/root with jump Markov sample paths
- The strong ergodic theorem for densities: Generalized Shannon-McMillan- Breiman theorem
- Accelerating Gaussian diffusions
- Langevin-type models. I: Diffusions with given stationary distributions and their discretizations
- Inference in hidden Markov models.
- Passive stochastic approximation
- Partially Observed Markov Decision Processes
- Asynchronous Stochastic Approximation Algorithms for Networked Systems: Regime-Switching Topologies and Multiscale Structure
- Online Markov Decision Processes With Kullback–Leibler Control Cost
- Nonparametric sequential estimation of zeros and extrema of regression functions
- Recursive Stochastic Algorithms for Global Optimization in $\mathbb{R}^d $
- Analysis of recursive stochastic algorithms
- How to apply the method of stochastic approximation in the non-parametric estimation of a regression function1
- Introduction to Stochastic Search and Optimization
- ${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control
- Monotonicity of Constrained Optimal Transmission Policies in Correlated Fading Channels With ARQ
- Regime Switching Stochastic Approximation Algorithms with Application to Adaptive Discrete Stochastic Optimization
- Passive stochastic approximation with constant step size and window width
- Distributed Subgradient Methods for Multi-Agent Optimization
- Stochastic Processes and Applications
- The Construction of Utility Functions from Expenditure Data