Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (Q2006767)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards
scientific article

    Statements

    Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (English)
    0 references
    0 references
    0 references
    12 October 2020
    0 references
    multi-armed bandit with covariates
    0 references
    delayed rewards
    0 references
    histogram method
    0 references
    strong consistency
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references