Approximate stochastic annealing for online control of infinite horizon Markov decision processes (Q1937498): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/j.automatica.2012.06.010 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1988071557 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4209222 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: New algorithms of the Q-learning type / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Adaptive Sampling Algorithm for Solving Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recursive Learning Automata Approach to Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of some simulation-based algorithms for Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Asymptotically Efficient Simulation-Based Algorithm for Finite Horizon Stochastic Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for the uniqueness of optimal policies of discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the almost sure convergence of a general stochastic approximation procedure / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reinforcement Learning: A Tutorial Survey and Recent Advances / rank
 
Normal rank
Property / cites work
 
Property / cites work: Cooling Schedules for Optimal Annealing / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation methods for constrained and unconstrained systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4346705 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Introduction to Stochastic Search and Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank

Latest revision as of 06:17, 6 July 2024

scientific article
Language Label Description Also known as
English
Approximate stochastic annealing for online control of infinite horizon Markov decision processes
scientific article

    Statements

    Approximate stochastic annealing for online control of infinite horizon Markov decision processes (English)
    0 references
    0 references
    0 references
    0 references
    1 March 2013
    0 references
    0 references
    0 references
    0 references
    0 references
    algorithms
    0 references
    Markov decision process
    0 references
    stochastic approximation
    0 references
    simulation
    0 references
    0 references