Pages that link to "Item:Q230108"

From MaRDI portal

← Shalabh Bhatnagar (Q230108)

Jump to:navigation, search

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Shalabh Bhatnagar (Q230108):

Displaying 50 items.

(Q441137) (redirect page) (← links)
Stochastic recursive algorithms for optimization. Simultaneous perturbation methods (Q441138) (← links)
Newton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithms (Q472587) (← links)
The Borkar-Meyn theorem for asynchronous stochastic approximations (Q553371) (← links)
An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes (Q616967) (← links)
Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems (Q636548) (← links)
General-sum stochastic games: verifiability conditions for Nash equilibria (Q694841) (← links)
Actor-critic algorithms for hierarchical Markov decision processes (Q856510) (← links)
Necessary and sufficient conditions for optimality in constrained general sum stochastic games (Q888801) (← links)
An extension of Wick's theorem (Q951186) (← links)
Natural actor-critic algorithms (Q1049136) (← links)
A time aggregation approach to Markov decision processes (Q1614322) (← links)
An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868) (← links)
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method (Q1631797) (← links)
On tight bounds for function approximation error in risk-sensitive reinforcement learning (Q2243003) (← links)
Simultaneous perturbation Newton algorithms for simulation optimization (Q2260692) (← links)
A stability criterion for two timescale stochastic approximation schemes (Q2409333) (← links)
New algorithms of the Q-learning type (Q2440701) (← links)
Gelfand-Yaglom-Perez theorem for generalized relative entropy functionals (Q2465342) (← links)
Multi-armed bandits based on a variant of simulated annealing (Q2520136) (← links)
Reinforcement learning based algorithms for average cost Markov decision processes (Q2643632) (← links)
(Q2724383) (← links)
Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem (Q2833720) (← links)
(Q3174029) (← links)
Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise (Q3387930) (← links)
Ant Colony Optimization Algorithms for Shortest Path Problems (Q3616975) (← links)
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences (Q4564833) (← links)
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization (Q4565351) (← links)
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization (Q4565394) (← links)
Optimal parameter trajectory estimation in parameterized SDEs (Q4565411) (← links)
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes (Q4567182) (← links)
(Q4576234) (← links)
Stochastic approximation algorithms for constrained optimization via simulation (Q4635177) (← links)
A Convex Analytic Framework for Ergodic Control of Semi-Markov Processes (Q4881517) (← links)
A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization (Q4950732) (← links)
Asynchronous Stochastic Approximations With Asymptotically Biased Errors and Deep Multiagent Learning (Q4957690) (← links)
Stochastic Approximation With Iterate-Dependent Markov Noise Under Verifiable Conditions in Compact State Space With the Stability of Iterates Not Ensured (Q5033892) (← links)
Generalized Second-Order Value Iteration in Markov Decision Processes (Q5040103) (← links)
An Incremental Fast Policy Search Using a Single Sample Path (Q5045345) (← links)
Stochastic recursive inclusions with non-additive iterate-dependent Markov noise (Q5085839) (← links)
Analysis of Stochastic Approximation Schemes With Set-Valued Maps in the Absence of a Stability Guarantee and Their Stabilization (Q5125524) (← links)
Random Directions Stochastic Approximation With Deterministic Perturbations (Q5125671) (← links)
Gradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous Space (Q5131717) (← links)
A simulation‐based algorithm for optimal pricing policy under demand uncertainty (Q5175839) (← links)
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning (Q5219302) (← links)
Stability of Stochastic Approximations With “Controlled Markov” Noise and Temporal Difference Learning (Q5223776) (← links)
Actor-Critic Algorithms with Online Feature Adaptation (Q5270681) (← links)
Smoothed Functional Algorithms for Stochastic Optimization Using <i>q</i> -Gaussian Distributions (Q5270716) (← links)
A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes (Q5273731) (← links)
Adaptive System Optimization Using Random Directions Stochastic Approximation (Q5280408) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q230108"