Sleeping experts and bandits approach to constrained Markov decision processes (Q901196): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / arXiv ID
 
Property / arXiv ID: 1412.4898 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation algorithms for constrained optimization via simulation / rank
 
Normal rank
Property / cites work
 
Property / cites work: An exact iterative search algorithm for constrained Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based algorithms for Markov decision processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Non-randomized policies for constrained Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: ${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Constrained Discounted Markov Decision Processes and Hamiltonian Cycles / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret bounds for sleeping experts and bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Sample Average Approximation Method for Stochastic Discrete Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-Based Discrete Optimization of Stochastic Discrete Event Systems Subject to Non Closed-Form Constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastically Constrained Ranking and Selection via SCORE / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sample average approximation of expected value constrained stochastic programs / rank
 
Normal rank

Latest revision as of 05:47, 11 July 2024

scientific article
Language Label Description Also known as
English
Sleeping experts and bandits approach to constrained Markov decision processes
scientific article

    Statements

    Sleeping experts and bandits approach to constrained Markov decision processes (English)
    0 references
    0 references
    0 references
    23 December 2015
    0 references
    Markov decision processes
    0 references
    sleeping experts and bandits
    0 references
    learning algorithm
    0 references
    constrained optimization
    0 references

    Identifiers