L. A. Prashanth

List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

Publication	Date of Publication	Type
Nonasymptotic Bounds for Stochastic Optimization With Biased Noisy Gradient Oracles IEEE Transactions on Automatic Control	2023-09-28	Paper
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling Machine Learning	2021-11-24	Paper
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpoint Systems & Control Letters	2021-11-10	Paper
Concentration bounds for empirical conditional value-at-risk: the unbounded case Operations Research Letters	2020-02-10	Paper
Stochastic Optimization in a Cumulative Prospect Theory Framework IEEE Transactions on Automatic Control	2018-09-18	Paper
Random directions stochastic approximation with deterministic perturbations (available as arXiv preprint)	2018-08-08	Paper
Variance-constrained actor-critic algorithms for discounted and average reward MDPs Machine Learning	2018-01-12	Paper
Adaptive System Optimization Using Random Directions Stochastic Approximation IEEE Transactions on Automatic Control	2017-07-27	Paper
A constrained optimization perspective on actor-critic algorithms and application to network routing Systems & Control Letters	2016-05-20	Paper
Simultaneous perturbation Newton algorithms for simulation optimization Journal of Optimization Theory and Applications	2015-03-11	Paper
Policy gradients for CVaR-constrained MDPs Lecture Notes in Computer Science	2015-01-14	Paper
Stochastic recursive algorithms for optimization. Simultaneous perturbation methods Lecture Notes in Control and Information Sciences	2012-08-20	Paper

Research outcomes over time

This page was built for person: L. A. Prashanth