L. A. Prashanth

From MaRDI portal
(Redirected from Person:286515)



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Nonasymptotic Bounds for Stochastic Optimization With Biased Noisy Gradient Oracles
IEEE Transactions on Automatic Control
2023-09-28Paper
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling
Machine Learning
2021-11-24Paper
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpoint
Systems & Control Letters
2021-11-10Paper
Concentration bounds for empirical conditional value-at-risk: the unbounded case
Operations Research Letters
2020-02-10Paper
Stochastic Optimization in a Cumulative Prospect Theory Framework
IEEE Transactions on Automatic Control
2018-09-18Paper
Random directions stochastic approximation with deterministic perturbations
(available as arXiv preprint)
2018-08-08Paper
Variance-constrained actor-critic algorithms for discounted and average reward MDPs
Machine Learning
2018-01-12Paper
Adaptive System Optimization Using Random Directions Stochastic Approximation
IEEE Transactions on Automatic Control
2017-07-27Paper
A constrained optimization perspective on actor-critic algorithms and application to network routing
Systems & Control Letters
2016-05-20Paper
Simultaneous perturbation Newton algorithms for simulation optimization
Journal of Optimization Theory and Applications
2015-03-11Paper
Policy gradients for CVaR-constrained MDPs
Lecture Notes in Computer Science
2015-01-14Paper
Stochastic recursive algorithms for optimization. Simultaneous perturbation methods
Lecture Notes in Control and Information Sciences
2012-08-20Paper


Research outcomes over time


This page was built for person: L. A. Prashanth