scientific article; zbMATH DE number 6542809

From MaRDI portal

Publication:5744820

Jump to:navigation, search

zbMath1351.68236MaRDI QIDQ5744820

Thorsten Joachims, Adith Swaminathan

Publication date: 19 February 2016

Full work available at URL: http://jmlr.csail.mit.edu/papers/v16/swaminathan15a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

importance sampling propensity score matching empirical risk minimization structured prediction bandit feedback

Mathematics Subject Classification ID

Nonparametric robustness (62G35) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)

Related Items

On Robustness of Individualized Decision Rules ⋮ Learning MAX-SAT from contextual examples for combinatorial optimisation ⋮ Constructing effective personalized policies using counterfactual inference from biased data sets with many features ⋮ Orthogonal statistical learning ⋮ Off-policy evaluation in partially observed Markov decision processes under sequential ignorability ⋮ Adversarial balancing-based representation learning for causal effect inference with observational data ⋮ Variational learning from implicit bandit feedback ⋮ Lessons on off-policy methods from a notification component of a chatbot ⋮ More Efficient Policy Learning via Optimal Retargeting ⋮ Unnamed Item ⋮ Learning When-to-Treat Policies ⋮ Unnamed Item

Uses Software

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5744820&oldid=30503641"