A sensitivity formula for risk-sensitive cost and the actor-critic algorithm (Q5958425): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(4 intermediate revisions by 4 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multiplicative ergodicity and large deviations for an irreducible Markov chain. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning for Risk-Sensitive Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbation realization, potentials, and sensitivity analysis of Markov processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Connections between stochastic control and dynamic games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Control of Discrete-Time Markov Processes with Infinite Horizon / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Control of Finite State Machines on an Infinite Horizon I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk sensitive control of Markov processes in countable state space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4346705 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis of recursive stochastic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based optimization of Markov reward processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997540 / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q127227136 / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/s0167-6911(01)00152-9 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1990437501 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 09:58, 30 July 2024

scientific article; zbMATH DE number 1715426
Language Label Description Also known as
English
A sensitivity formula for risk-sensitive cost and the actor-critic algorithm
scientific article; zbMATH DE number 1715426

    Statements

    A sensitivity formula for risk-sensitive cost and the actor-critic algorithm (English)
    0 references
    0 references
    3 March 2002
    0 references
    Markov decision processes
    0 references
    risk sensitive control
    0 references
    reinforcement learning
    0 references
    actor-critic algorithms
    0 references
    parametric sensitivity
    0 references
    stochastic approximation
    0 references

    Identifiers