Shalabh Bhatnagar

List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

Publication	Date of Publication	Type
An actor-critic algorithm with function approximation for risk sensitive cost Markov decision processes IEEE Transactions on Automatic Control	2026-03-17	Paper
n-step temporal difference learning with optimal n Automatica	2025-08-05	Paper
Generalized simultaneous perturbation-based gradient search with reduced estimator bias IEEE Transactions on Automatic Control	2025-07-14	Paper
Approximate linear programming for decentralized policy iteration in cooperative multi-agent Markov decision processes Systems & Control Letters	2025-02-18	Paper
Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization Automatica	2024-04-24	Paper
A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games IEEE Transactions on Automatic Control	2023-09-21	Paper
An Incremental Fast Policy Search Using a Single Sample Path Lecture Notes in Computer Science	2022-11-04	Paper
Generalized Second-Order Value Iteration in Markov Decision Processes IEEE Transactions on Automatic Control	2022-10-11	Paper
Analyzing approximate value iteration algorithms Mathematics of Operations Research	2022-09-26	Paper
Stochastic recursive inclusions with non-additive iterate-dependent Markov noise Stochastics	2022-06-30	Paper
Stochastic Approximation With Iterate-Dependent Markov Noise Under Verifiable Conditions in Compact State Space With the Stability of Iterates Not Ensured IEEE Transactions on Automatic Control	2022-02-24	Paper
On tight bounds for function approximation error in risk-sensitive reinforcement learning Systems & Control Letters	2021-11-10	Paper
Asynchronous Stochastic Approximations With Asymptotically Biased Errors and Deep Multiagent Learning IEEE Transactions on Automatic Control	2021-09-09	Paper
Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise Mathematics of Operations Research	2021-01-08	Paper
Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise Mathematics of Operations Research	2021-01-08	Paper
Gradient-based adaptive stochastic search for simulation optimization over continuous space INFORMS Journal on Computing	2020-11-09	Paper
Random Directions Stochastic Approximation With Deterministic Perturbations IEEE Transactions on Automatic Control	2020-10-07	Paper
Analysis of Stochastic Approximation Schemes With Set-Valued Maps in the Absence of a Stability Guarantee and Their Stabilization IEEE Transactions on Automatic Control	2020-10-07	Paper
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning Mathematics of Operations Research	2020-03-11	Paper
Stability of Stochastic Approximations With “Controlled Markov” Noise and Temporal Difference Learning IEEE Transactions on Automatic Control	2019-07-18	Paper
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method Machine Learning	2018-12-07	Paper
An incremental off-policy search in a model-free Markov decision process using a single sample path Machine Learning	2018-11-12	Paper
scientific article; zbMATH DE number 6936843 (Why is no real title available?) (available as arXiv preprint)	2018-09-14	Paper
Random directions stochastic approximation with deterministic perturbations (available as arXiv preprint)	2018-08-08	Paper
Revisiting the cross entropy method with applications in stochastic global optimization and reinforcement learning	2018-07-12	Paper
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes IEEE Transactions on Automatic Control	2018-06-27	Paper
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences ACM Transactions on Modeling and Computer Simulation	2018-06-12	Paper
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization ACM Transactions on Modeling and Computer Simulation	2018-06-12	Paper
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization ACM Transactions on Modeling and Computer Simulation	2018-06-12	Paper
Optimal parameter trajectory estimation in parameterized SDEs: an algorithmic procedure ACM Transactions on Modeling and Computer Simulation	2018-06-12	Paper
Stochastic approximation algorithms for constrained optimization via simulation ACM Transactions on Modeling and Computer Simulation	2018-04-16	Paper
A stability criterion for two timescale stochastic approximation schemes Automatica	2017-10-11	Paper
A generalization of the Borkar-Meyn theorem for stochastic recursive inclusions Mathematics of Operations Research	2017-09-22	Paper
Adaptive System Optimization Using Random Directions Stochastic Approximation IEEE Transactions on Automatic Control	2017-07-27	Paper
A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes IEEE Transactions on Automatic Control	2017-07-12	Paper
Smoothed functional algorithms for stochastic optimization using q-Gaussian distributions ACM Transactions on Modeling and Computer Simulation	2017-06-30	Paper
Actor-critic algorithms with online feature adaptation ACM Transactions on Modeling and Computer Simulation	2017-06-30	Paper
Multi-armed bandits based on a variant of simulated annealing Indian Journal of Pure & Applied Mathematics	2016-12-13	Paper
Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem Stochastics	2016-11-25	Paper
Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem Stochastics	2016-11-25	Paper
Multiscale Q-learning with linear function approximation Discrete Event Dynamic Systems	2016-09-16	Paper
Dynamics of stochastic approximation with iterate-dependent Markov noise under verifiable conditions in compact state space with the stability of iterates not ensured (available as arXiv preprint)	2016-01-10	Paper
Necessary and sufficient conditions for optimality in constrained general sum stochastic games Systems & Control Letters	2015-11-02	Paper
Simultaneous perturbation Newton algorithms for simulation optimization Journal of Optimization Theory and Applications	2015-03-11	Paper
A simulation-based algorithm for optimal pricing policy under demand uncertainty International Transactions in Operational Research	2015-02-25	Paper
Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithms Automatica	2014-11-19	Paper
New algorithms of the Q-learning type Automatica	2014-03-19	Paper
General-sum stochastic games: verifiability conditions for Nash equilibria Automatica	2012-12-13	Paper
Stochastic recursive algorithms for optimization. Simultaneous perturbation methods Lecture Notes in Control and Information Sciences	2012-08-20	Paper
An online actor-critic algorithm with function approximation for constrained Markov decision processes Journal of Optimization Theory and Applications	2012-07-31	Paper
scientific article; zbMATH DE number 5957388 (Why is no real title available?)	2011-10-12	Paper
Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems Applied Mathematical Modelling	2011-08-28	Paper
The Borkar-Meyn theorem for asynchronous stochastic approximations Systems & Control Letters	2011-07-27	Paper
An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes Systems & Control Letters	2011-01-12	Paper
Natural actor-critic algorithms Automatica	2010-01-08	Paper
Ant Colony Optimization Algorithms for Shortest Path Problems Lecture Notes in Computer Science	2009-03-26	Paper
An extension of Wick's theorem Statistics & Probability Letters	2008-10-30	Paper
Gelfand-Yaglom-Perez theorem for generalized relative entropy functionals Information Sciences	2008-01-03	Paper
Reinforcement learning based algorithms for average cost Markov decision processes Discrete Event Dynamic Systems	2007-08-27	Paper
Actor-critic algorithms for hierarchical Markov decision processes Automatica	2006-12-07	Paper
Multiscale Stochastic Approximation for Parametric Optimization of Hidden Markov Models Probability in the Engineering and Informational Sciences	2006-09-22	Paper
Nongeneralizability of Tsallis Entropy by means of Kolmogorov-Nagumo averages under pseudo-additivity	2005-05-30	Paper
Nonextensive triangle equality and other properties of Tsallis relative-entropy minimization	2005-01-11	Paper
A time aggregation approach to Markov decision processes Automatica	2002-09-05	Paper
An optimal fuel-injection policy for performance enhancement in internal combustion engines. Sādhanā	2002-02-18	Paper
Two timescale SPSA algorithms for rate-based ABR flow control	2001-12-17	Paper
A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization Probability in the Engineering and Informational Sciences	2000-12-12	Paper
A Convex Analytic Framework for Ergodic Control of Semi-Markov Processes Mathematics of Operations Research	1996-07-15	Paper

Research outcomes over time

This page was built for person: Shalabh Bhatnagar