Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (Q2006767)

From MaRDI portal
Revision as of 19:56, 23 July 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards
scientific article

    Statements

    Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (English)
    0 references
    0 references
    0 references
    12 October 2020
    0 references
    multi-armed bandit with covariates
    0 references
    delayed rewards
    0 references
    histogram method
    0 references
    strong consistency
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references