| Publication | Date of Publication | Type |
|---|
Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization Automatica | 2024-04-24 | Paper |
A Generalized Minimax Q-Learning Algorithm for Two-Player Zero-Sum Stochastic Games IEEE Transactions on Automatic Control | 2023-09-21 | Paper |
An Incremental Fast Policy Search Using a Single Sample Path Lecture Notes in Computer Science | 2022-11-04 | Paper |
Generalized Second-Order Value Iteration in Markov Decision Processes IEEE Transactions on Automatic Control | 2022-10-11 | Paper |
Analyzing approximate value iteration algorithms Mathematics of Operations Research | 2022-09-26 | Paper |
Stochastic recursive inclusions with non-additive iterate-dependent Markov noise Stochastics | 2022-06-30 | Paper |
Stochastic Approximation With Iterate-Dependent Markov Noise Under Verifiable Conditions in Compact State Space With the Stability of Iterates Not Ensured IEEE Transactions on Automatic Control | 2022-02-24 | Paper |
On tight bounds for function approximation error in risk-sensitive reinforcement learning Systems & Control Letters | 2021-11-10 | Paper |
Asynchronous Stochastic Approximations With Asymptotically Biased Errors and Deep Multiagent Learning IEEE Transactions on Automatic Control | 2021-09-09 | Paper |
Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise Mathematics of Operations Research | 2021-01-08 | Paper |
Gradient-based adaptive stochastic search for simulation optimization over continuous space INFORMS Journal on Computing | 2020-11-09 | Paper |
Random Directions Stochastic Approximation With Deterministic Perturbations IEEE Transactions on Automatic Control | 2020-10-07 | Paper |
Analysis of Stochastic Approximation Schemes With Set-Valued Maps in the Absence of a Stability Guarantee and Their Stabilization IEEE Transactions on Automatic Control | 2020-10-07 | Paper |
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning Mathematics of Operations Research | 2020-03-11 | Paper |
Stability of Stochastic Approximations With “Controlled Markov” Noise and Temporal Difference Learning IEEE Transactions on Automatic Control | 2019-07-18 | Paper |
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method Machine Learning | 2018-12-07 | Paper |
An incremental off-policy search in a model-free Markov decision process using a single sample path Machine Learning | 2018-11-12 | Paper |
scientific article; zbMATH DE number 6936843 (Why is no real title available?) | 2018-09-14 | Paper |
Random directions stochastic approximation with deterministic perturbations | 2018-08-08 | Paper |
Revisiting the cross entropy method with applications in stochastic global optimization and reinforcement learning | 2018-07-12 | Paper |
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes IEEE Transactions on Automatic Control | 2018-06-27 | Paper |
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences ACM Transactions on Modeling and Computer Simulation | 2018-06-12 | Paper |
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization ACM Transactions on Modeling and Computer Simulation | 2018-06-12 | Paper |
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization ACM Transactions on Modeling and Computer Simulation | 2018-06-12 | Paper |
Optimal parameter trajectory estimation in parameterized SDEs: an algorithmic procedure ACM Transactions on Modeling and Computer Simulation | 2018-06-12 | Paper |
Stochastic approximation algorithms for constrained optimization via simulation ACM Transactions on Modeling and Computer Simulation | 2018-04-16 | Paper |
A stability criterion for two timescale stochastic approximation schemes Automatica | 2017-10-11 | Paper |
A generalization of the Borkar-Meyn theorem for stochastic recursive inclusions Mathematics of Operations Research | 2017-09-22 | Paper |
Adaptive System Optimization Using Random Directions Stochastic Approximation IEEE Transactions on Automatic Control | 2017-07-27 | Paper |
A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes IEEE Transactions on Automatic Control | 2017-07-12 | Paper |
Smoothed functional algorithms for stochastic optimization using \(q\)-Gaussian distributions ACM Transactions on Modeling and Computer Simulation | 2017-06-30 | Paper |
Actor-critic algorithms with online feature adaptation ACM Transactions on Modeling and Computer Simulation | 2017-06-30 | Paper |
Multi-armed bandits based on a variant of simulated annealing Indian Journal of Pure & Applied Mathematics | 2016-12-13 | Paper |
Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem Stochastics | 2016-11-25 | Paper |
Multiscale Q-learning with linear function approximation Discrete Event Dynamic Systems | 2016-09-16 | Paper |
Dynamics of stochastic approximation with iterate-dependent Markov noise under verifiable conditions in compact state space with the stability of iterates not ensured | 2016-01-10 | Paper |
Necessary and sufficient conditions for optimality in constrained general sum stochastic games Systems & Control Letters | 2015-11-02 | Paper |
Simultaneous perturbation Newton algorithms for simulation optimization Journal of Optimization Theory and Applications | 2015-03-11 | Paper |
A simulation-based algorithm for optimal pricing policy under demand uncertainty International Transactions in Operational Research | 2015-02-25 | Paper |
Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithms Automatica | 2014-11-19 | Paper |
New algorithms of the Q-learning type Automatica | 2014-03-19 | Paper |
General-sum stochastic games: verifiability conditions for Nash equilibria Automatica | 2012-12-13 | Paper |
Stochastic recursive algorithms for optimization. Simultaneous perturbation methods Lecture Notes in Control and Information Sciences | 2012-08-20 | Paper |
An online actor-critic algorithm with function approximation for constrained Markov decision processes Journal of Optimization Theory and Applications | 2012-07-31 | Paper |
scientific article; zbMATH DE number 5957388 (Why is no real title available?) | 2011-10-12 | Paper |
Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems Applied Mathematical Modelling | 2011-08-28 | Paper |
The Borkar-Meyn theorem for asynchronous stochastic approximations Systems & Control Letters | 2011-07-27 | Paper |
An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes Systems & Control Letters | 2011-01-12 | Paper |
Natural actor-critic algorithms Automatica | 2010-01-08 | Paper |
Ant Colony Optimization Algorithms for Shortest Path Problems Lecture Notes in Computer Science | 2009-03-26 | Paper |
An extension of Wick's theorem Statistics & Probability Letters | 2008-10-30 | Paper |
Gelfand-Yaglom-Perez theorem for generalized relative entropy functionals Information Sciences | 2008-01-03 | Paper |
Reinforcement learning based algorithms for average cost Markov decision processes Discrete Event Dynamic Systems | 2007-08-27 | Paper |
Actor-critic algorithms for hierarchical Markov decision processes Automatica | 2006-12-07 | Paper |
Multiscale Stochastic Approximation for Parametric Optimization of Hidden Markov Models Probability in the Engineering and Informational Sciences | 2006-09-22 | Paper |
Nongeneralizability of Tsallis Entropy by means of Kolmogorov-Nagumo averages under pseudo-additivity | 2005-05-30 | Paper |
Nonextensive triangle equality and other properties of Tsallis relative-entropy minimization | 2005-01-11 | Paper |
A time aggregation approach to Markov decision processes Automatica | 2002-09-05 | Paper |
An optimal fuel-injection policy for performance enhancement in internal combustion engines. Sādhanā | 2002-02-18 | Paper |
Two timescale SPSA algorithms for rate-based ABR flow control | 2001-12-17 | Paper |
A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization Probability in the Engineering and Informational Sciences | 2000-12-12 | Paper |
A Convex Analytic Framework for Ergodic Control of Semi-Markov Processes Mathematics of Operations Research | 1996-07-15 | Paper |