Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (Q5166474): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1080/03081079.2014.883387 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2024928738 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning Algorithms for Markov Decision Processes with Average Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-sensitive capacity control in revenue management / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4368722 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Time consistent dynamic risk measures / rank
 
Normal rank
Property / cites work
 
Property / cites work: Target-level criterion in Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based algorithms for Markov decision processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Variance-Penalized Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Percentile performance criteria for limiting average Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A risk-sensitive approach to total productive maintenance / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4393471 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Constrained Optimization for Average Cost Continuous-Time Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Minimum risk probability for finite horizon semi-Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-sensitive control with HARA utility / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-sensitive reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3491338 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4224236 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal threshold probability and expectation in semi-Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: The variance of discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Minimising a threshold probability in discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Minimizing risk models in Markov decision processes with policies depending on target values / rank
 
Normal rank

Latest revision as of 16:07, 8 July 2024

scientific article; zbMATH DE number 6309105
Language Label Description Also known as
English
Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques
scientific article; zbMATH DE number 6309105

    Statements

    Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (English)
    0 references
    0 references
    27 June 2014
    0 references
    variance-penalized MDPs
    0 references
    dynamic programming
    0 references
    risk penalties
    0 references
    reinforcement learning
    0 references
    Bellman equation
    0 references

    Identifiers