Shalabh Bhatnagar

From MaRDI portal


List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization
Automatica
2024-04-24Paper
A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games
IEEE Transactions on Automatic Control
2023-09-21Paper
An Incremental Fast Policy Search Using a Single Sample Path
Lecture Notes in Computer Science
2022-11-04Paper
Generalized Second-Order Value Iteration in Markov Decision Processes
IEEE Transactions on Automatic Control
2022-10-11Paper
Analyzing approximate value iteration algorithms
Mathematics of Operations Research
2022-09-26Paper
Stochastic recursive inclusions with non-additive iterate-dependent Markov noise
Stochastics
2022-06-30Paper
Stochastic Approximation With Iterate-Dependent Markov Noise Under Verifiable Conditions in Compact State Space With the Stability of Iterates Not Ensured
IEEE Transactions on Automatic Control
2022-02-24Paper
On tight bounds for function approximation error in risk-sensitive reinforcement learning
Systems & Control Letters
2021-11-10Paper
Asynchronous Stochastic Approximations With Asymptotically Biased Errors and Deep Multiagent Learning
IEEE Transactions on Automatic Control
2021-09-09Paper
Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise
Mathematics of Operations Research
2021-01-08Paper
Gradient-based adaptive stochastic search for simulation optimization over continuous space
INFORMS Journal on Computing
2020-11-09Paper
Random Directions Stochastic Approximation With Deterministic Perturbations
IEEE Transactions on Automatic Control
2020-10-07Paper
Analysis of Stochastic Approximation Schemes With Set-Valued Maps in the Absence of a Stability Guarantee and Their Stabilization
IEEE Transactions on Automatic Control
2020-10-07Paper
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning
Mathematics of Operations Research
2020-03-11Paper
Stability of Stochastic Approximations With “Controlled Markov” Noise and Temporal Difference Learning
IEEE Transactions on Automatic Control
2019-07-18Paper
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
Machine Learning
2018-12-07Paper
An incremental off-policy search in a model-free Markov decision process using a single sample path
Machine Learning
2018-11-12Paper
scientific article; zbMATH DE number 6936843 (Why is no real title available?)
 
2018-09-14Paper
Random directions stochastic approximation with deterministic perturbations
 
2018-08-08Paper
Revisiting the cross entropy method with applications in stochastic global optimization and reinforcement learning
 
2018-07-12Paper
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes
IEEE Transactions on Automatic Control
2018-06-27Paper
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
ACM Transactions on Modeling and Computer Simulation
2018-06-12Paper
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization
ACM Transactions on Modeling and Computer Simulation
2018-06-12Paper
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization
ACM Transactions on Modeling and Computer Simulation
2018-06-12Paper
Optimal parameter trajectory estimation in parameterized SDEs: an algorithmic procedure
ACM Transactions on Modeling and Computer Simulation
2018-06-12Paper
Stochastic approximation algorithms for constrained optimization via simulation
ACM Transactions on Modeling and Computer Simulation
2018-04-16Paper
A stability criterion for two timescale stochastic approximation schemes
Automatica
2017-10-11Paper
A generalization of the Borkar-Meyn theorem for stochastic recursive inclusions
Mathematics of Operations Research
2017-09-22Paper
Adaptive System Optimization Using Random Directions Stochastic Approximation
IEEE Transactions on Automatic Control
2017-07-27Paper
A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes
IEEE Transactions on Automatic Control
2017-07-12Paper
Smoothed functional algorithms for stochastic optimization using \(q\)-Gaussian distributions
ACM Transactions on Modeling and Computer Simulation
2017-06-30Paper
Actor-critic algorithms with online feature adaptation
ACM Transactions on Modeling and Computer Simulation
2017-06-30Paper
Multi-armed bandits based on a variant of simulated annealing
Indian Journal of Pure & Applied Mathematics
2016-12-13Paper
Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem
Stochastics
2016-11-25Paper
Multiscale Q-learning with linear function approximation
Discrete Event Dynamic Systems
2016-09-16Paper
Dynamics of stochastic approximation with iterate-dependent Markov noise under verifiable conditions in compact state space with the stability of iterates not ensured
 
2016-01-10Paper
Necessary and sufficient conditions for optimality in constrained general sum stochastic games
Systems & Control Letters
2015-11-02Paper
Simultaneous perturbation Newton algorithms for simulation optimization
Journal of Optimization Theory and Applications
2015-03-11Paper
A simulation-based algorithm for optimal pricing policy under demand uncertainty
International Transactions in Operational Research
2015-02-25Paper
Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithms
Automatica
2014-11-19Paper
New algorithms of the Q-learning type
Automatica
2014-03-19Paper
General-sum stochastic games: verifiability conditions for Nash equilibria
Automatica
2012-12-13Paper
Stochastic recursive algorithms for optimization. Simultaneous perturbation methods
Lecture Notes in Control and Information Sciences
2012-08-20Paper
An online actor-critic algorithm with function approximation for constrained Markov decision processes
Journal of Optimization Theory and Applications
2012-07-31Paper
scientific article; zbMATH DE number 5957388 (Why is no real title available?)
 
2011-10-12Paper
Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems
Applied Mathematical Modelling
2011-08-28Paper
The Borkar-Meyn theorem for asynchronous stochastic approximations
Systems & Control Letters
2011-07-27Paper
An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
Systems & Control Letters
2011-01-12Paper
Natural actor-critic algorithms
Automatica
2010-01-08Paper
Ant Colony Optimization Algorithms for Shortest Path Problems
Lecture Notes in Computer Science
2009-03-26Paper
An extension of Wick's theorem
Statistics & Probability Letters
2008-10-30Paper
Gelfand-Yaglom-Perez theorem for generalized relative entropy functionals
Information Sciences
2008-01-03Paper
Reinforcement learning based algorithms for average cost Markov decision processes
Discrete Event Dynamic Systems
2007-08-27Paper
Actor-critic algorithms for hierarchical Markov decision processes
Automatica
2006-12-07Paper
Multiscale Stochastic Approximation for Parametric Optimization of Hidden Markov Models
Probability in the Engineering and Informational Sciences
2006-09-22Paper
Nongeneralizability of Tsallis Entropy by means of Kolmogorov-Nagumo averages under pseudo-additivity
 
2005-05-30Paper
Nonextensive triangle equality and other properties of Tsallis relative-entropy minimization
 
2005-01-11Paper
A time aggregation approach to Markov decision processes
Automatica
2002-09-05Paper
An optimal fuel-injection policy for performance enhancement in internal combustion engines.
Sādhanā
2002-02-18Paper
Two timescale SPSA algorithms for rate-based ABR flow control
 
2001-12-17Paper
A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization
Probability in the Engineering and Informational Sciences
2000-12-12Paper
A Convex Analytic Framework for Ergodic Control of Semi-Markov Processes
Mathematics of Operations Research
1996-07-15Paper


Research outcomes over time


This page was built for person: Shalabh Bhatnagar