Decomposing risk in an exploitation-exploration problem with endogenous termination time
From MaRDI portal
Publication:1708513
DOI10.1007/s10479-017-2610-4zbMath1384.90116OpenAlexW2751630626MaRDI QIDQ1708513
Publication date: 23 March 2018
Published in: Annals of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10479-017-2610-4
Cites Work
- Four proofs of Gittins' multiarmed bandit theorem
- Supply chain risk analysis with mean-variance models: a technical review
- Learning, risk attitude and hot stoves in restless bandit problems
- Adaptive control of constrained Markov chains: Criteria and policies
- Active learning. Monte Carlo results
- Markowitz Revisited: Mean-Variance Models in Financial Portfolio Analysis
- An Economic Index of Riskiness
- A Learning Approach for Interactive Marketing to a Customer Segment
- Oligopoly Models for Optimal Advertising When Production Costs Obey a Learning Curve
- Risk Aversion in the Small and in the Large
- MEAN–VARIANCE PORTFOLIO OPTIMIZATION WITH STATE‐DEPENDENT RISK AVERSION
- Stationary multi-choice bandit problems.
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item