scientific article
From MaRDI portal
Publication:2834477
zbMath1392.62121MaRDI QIDQ2834477
Publication date: 22 November 2016
Full work available at URL: http://jmlr.csail.mit.edu/papers/v17/13-210.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
nonparametric regressioncontextual bandit problemregret boundupper confidence boundexploration-exploitation tradeoff
Nonparametric regression and quantile regression (62G08) Density estimation (62G07) Nonparametric tolerance and confidence regions (62G15) Applications of mathematical programming (90C90) Probabilistic games; gambling (91A60)
Related Items (2)
Statistical Inference for Online Decision Making via Stochastic Gradient Descent ⋮ Statistical Inference for Online Decision Making: In a Contextual Bandit Setting
This page was built for publication: