Good arm identification via bandit feedback (Q2425222): Difference between revisions
From MaRDI portal
Changed an Item |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Kullback-Leibler upper confidence bounds for optimal sequential allocation / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3093383 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q2810758 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: A procedure for selecting a subset of size m containing the l best of k independent normal populations, with applications to simulation / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Asymptotically efficient adaptive allocation rules / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4626283 / rank | |||
Normal rank |
Revision as of 17:33, 19 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Good arm identification via bandit feedback |
scientific article |
Statements
Good arm identification via bandit feedback (English)
0 references
26 June 2019
0 references
thresholding bandits
0 references
multi-armed bandits
0 references
reinforcement learning
0 references
machine learning
0 references