scientific article; zbMATH DE number 6542809
From MaRDI portal
Publication:5744820
zbMath1351.68236MaRDI QIDQ5744820
Thorsten Joachims, Adith Swaminathan
Publication date: 19 February 2016
Full work available at URL: http://jmlr.csail.mit.edu/papers/v16/swaminathan15a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
importance samplingpropensity score matchingempirical risk minimizationstructured predictionbandit feedback
Nonparametric robustness (62G35) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)
Related Items
On Robustness of Individualized Decision Rules ⋮ Learning MAX-SAT from contextual examples for combinatorial optimisation ⋮ Constructing effective personalized policies using counterfactual inference from biased data sets with many features ⋮ Orthogonal statistical learning ⋮ Off-policy evaluation in partially observed Markov decision processes under sequential ignorability ⋮ Adversarial balancing-based representation learning for causal effect inference with observational data ⋮ Variational learning from implicit bandit feedback ⋮ Lessons on off-policy methods from a notification component of a chatbot ⋮ More Efficient Policy Learning via Optimal Retargeting ⋮ Unnamed Item ⋮ Learning When-to-Treat Policies ⋮ Unnamed Item
Uses Software