Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (Q2006767)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards |
scientific article |
Statements
Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards (English)
0 references
12 October 2020
0 references
multi-armed bandit with covariates
0 references
delayed rewards
0 references
histogram method
0 references
strong consistency
0 references