Introduction to Multi-Armed Bandits

From MaRDI portal
Revision as of 17:32, 8 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:5213200

DOI10.1561/2200000068zbMath1478.68006arXiv1904.07272OpenAlexW4206275166WikidataQ126833114 ScholiaQ126833114MaRDI QIDQ5213200

Aleksandrs Slivkins

Publication date: 31 January 2020

Published in: Foundations and Trends® in Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1904.07272




Related Items (26)

Dynamic Learning and Market Making in Spread Betting Markets with Informed BettorsBayesian Exploration: Incentivizing Exploration in Bayesian GamesMultiplayer Bandits Without Observing Collision InformationMaximizing revenue for publishers using header bidding and ad exchange auctionsMulti-round cooperative search games with multiple playersOptimal activation of halting multi‐armed bandit modelsMulti-armed bandit-based hyper-heuristics for combinatorial optimization problemsOnline Resource Allocation with Personalized LearningRegret minimization in online Bayesian persuasion: handling adversarial receiver's types under full and partial feedback modelsOnline learning of network bottlenecks via minimax pathsMulti-armed bandits with censored consumption of resourcesA central limit theorem, loss aversion and multi-armed banditsConvergence rate analysis for optimal computing budget allocation algorithmsSemi-Supervised Node Classification via Semi-Global Graph Transformer Based on Homogeneity AugmentationUniversal regression with adversarial responsesControl-data separation and logical condition propagation for efficient inference on probabilistic programsEfficient and generalizable tuning strategies for stochastic gradient MCMCImproving Hoeffding's inequality using higher moments informationQuantum greedy algorithms for multi-armed banditsBallooning multi-armed banditsBayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacksMulti-armed bandit with sub-exponential rewardsUnnamed ItemReinforcement Learning Based Interactive Agent for Personalized Mathematical Skill EnhancementLearning in Repeated AuctionsBypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits Under Realizability




This page was built for publication: Introduction to Multi-Armed Bandits