An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems

From MaRDI portal

Revision as of 10:49, 3 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:2665165

Jump to:navigation, search

DOI10.1016/J.AUTOMATICA.2021.109673OpenAlexW3159888357MaRDI QIDQ2665165

Hyeong Soo Chang

Publication date: 18 November 2021

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2007.14550

zbMATH Keywords

learning theory multi-armed bandit constrained simulation-optimization

Mathematics Subject Classification ID

Artificial intelligence (68Txx) Calculus of variations and optimal control; optimization (49-XX)

Cites Work

This page was built for publication: An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2665165&oldid=15508174"