Multiclass classification with bandit feedback using adaptive regularization

From MaRDI portal

Revision as of 16:55, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1945036

Jump to:navigation, search

DOI10.1007/s10994-012-5321-8zbMath1260.68324OpenAlexW2142774925MaRDI QIDQ1945036

Koby Crammer, Claudio Gentile

Publication date: 28 March 2013

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10994-012-5321-8

zbMATH Keywords

online learning regret upper confidence bound

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Online algorithms; streaming algorithms (68W27)

Related Items

Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning, New bounds on the price of bandit feedback for mistake-bounded online multiclass learning

Uses Software

RCV1

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1945036&oldid=14379923"