The following pages link to Linearly Parameterized Bandits (Q3169099):
Displaying 31 items.
- Best arm identification in generalized linear bandits (Q2060547) (← links)
- Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit (Q2074307) (← links)
- Stochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithm (Q2091834) (← links)
- Online Collaborative Filtering on Graphs (Q2830757) (← links)
- Optimal Learning in Linear Regression with Combinatorial Feature Selection (Q2960366) (← links)
- Active Learning of Bayesian Linear Models with High-Dimensional Binary Features by Parameter Confidence-Region Estimation (Q3386414) (← links)
- (Q4558206) (← links)
- (Q4558552) (← links)
- Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback (Q4596721) (← links)
- Learning to Optimize via Information-Directed Sampling (Q4969321) (← links)
- Technical Note—A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient Agents (Q4994155) (← links)
- A Bandit-Learning Approach to Multifidelity Approximation (Q5022495) (← links)
- (Q5054631) (← links)
- Ranking and Selection with Covariates for Personalized Decision Making (Q5084611) (← links)
- Dynamic Learning and Decision Making via Basis Weight Vectors (Q5095179) (← links)
- Online Resource Allocation with Personalized Learning (Q5106359) (← links)
- MNL-Bandit: A Dynamic Learning Approach to Assortment Selection (Q5129205) (← links)
- Online Decision Making with High-Dimensional Covariates (Q5130496) (← links)
- Online Network Revenue Management Using Thompson Sampling (Q5131540) (← links)
- Learning in Combinatorial Optimization: What and How to Explore (Q5144784) (← links)
- (Q5149014) (← links)
- A linear response bandit problem (Q5168867) (← links)
- Dynamic Pricing with Multiple Products and Partially Specified Demand Distribution (Q5244873) (← links)
- (Q5247113) (← links)
- Learning to Optimize via Posterior Sampling (Q5247618) (← links)
- Satisficing in Time-Sensitive Bandit Learning (Q5870357) (← links)
- Randomized allocation with arm elimination in a bandit problem with covariates (Q5965323) (← links)
- Technical note—Knowledge gradient for selection with covariates: Consistency and computation (Q6053135) (← links)
- A tractable online learning algorithm for the multinomial logit contextual bandit (Q6113379) (← links)
- Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems (Q6167036) (← links)
- Multi-armed linear bandits with latent biases (Q6198758) (← links)